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5 POLYNUCLEOTIDE ENCODING A GENE CONFERRING RESISTANCE 

TO BACILLUS THURINGIENSIStOXINS 

10 STATEMENTAS TO RIGHTS TO INVENTIONS MADE 

UNDER FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 
The United States Government may have rights to this Invention under 
the terms of a sponsored research agreement by the National Science 
Foundation, grant number MCB-g81 6056. 
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RELATED APPLICATIONS . 
This application claims the benefit of US provtsidnai application having 
serial number 60/276,180 filed on March 15, 2001, and which is incorporated 
herein by reference. 



FIELD OF THE INVENTION 
This invention is directed towards the occurrence and identification of 
pesticide tolerance of certain insects. The invention makes use of specific 
polynucleotide sequences associated with the onset of resistance to Bacillus 
25 thuringiensis toxins which are used as probes to monitor the presence of 
acquired insect resistance assodated with transgenic crops. The specific 
polynucleotide sequences are also used to nK)nitor changes in the 
frequencies of alleles which confer the resistance to the toxins. 

30 BACKGROUND OF THE INVENTION 

The bacterium Sac///us thuringiensis (Bt) contains genes encoding 
insecticidal proteins. Bt proteins are toxic when ingested by susceptible 
insect larvae. The protein attacks the insect's midgut, causes cessation of 
feeding, and eventually kills the insect Bt toxins have been prc>duced as 

35 fermentation products of Bt cultures and used in spray formulations for crop 
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protection. Bt genes have also been used commerdaliy to trahsfomi crop 
plants; these transgenic crop plants* cells then produce the insecti^ai protein 
which attacks susceptible insects that attempt to feed on the plant. 

The general mode of action of Bt toxins is well known in the art and is 
5 described for example by Rajamohan F, Lee MK. Dean DH {^996) Progress 
in Nucleic Acid Research and Molecular Biology 60: 1 -27. The protein 
produced by the bacterium is usually a protoxin, which Itself is not toxic until it 
is proteoiyticaliy cleaved by the insecfs own proteases. The smaller protein 
resulting from proteolysis is the active toxin. This toxin diffuses through the 

10 peritrophic membrane to the midgut epithelium, where it binds to one or more 
sites in the membrane. This initial binding step may be reversit>le, but 
eventually the toxin becomes irreversibly bound to the membrane. A 
conformational change occurs in the toxin, whereby membrane^spanning 
alpha helices are inserted into the merribrane, where they aggregate and form 

15 pores. These pores disrupt the normal osmotic balance of the epithelial cells. 
The cells swell and lyse, leading to destruction of the midgut epithelial cell 
layer and eventual death of the insect 

The initial binding step is believed to be necessary for toxin action; 
consequentiy there have been many studies on binding interactions of Bt 

20 toxins and componente of the midgut, described for example by Pietrantonio 
PV and Gill SS (1996) in Biology of the Insect Midgut, Chapman & Hall, 
London, pp 345-372. Techniques used to study binding often start with the 
isolation of brush border membrane vesiclfes (BBMVs) from the microvillar 
portion of columnar epithelial cells. Binding to BBMVs in suspension can be 

25 measured using labeled toxin. Alternatively, proteins can be isolated from 
BBMVs, separated by denaturing electrophoresis conditions, transferred to 
membranes, and probed with toxin. In addition, histological sections of insect 
midguts can be prepared and binding of labeled toxin can t)e visualized using 
microscopy, 

30 Binding of Bt toxins to specific insect proteins can also be measured. 

Several proteins that interact with Bt toxins are well known in the art; 
Aminopeptidases exist in many different forms in insect midguts, and many of 
them havie been shown to bind Bt toxins (Knight PJK, Knowles BH, Ellar DJ 
(1995) Journal of Biological Chemistry 270 (30): 17765-17770; Gill SS, 
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Cowles EA, Francis V (1995) Journal of Biological Chemistry 270 (45): 27277- 
27282; Luo K. Sangadala S. Masson L, Mazza A, Brousseau R, Adang M J 
(1 997) Insect Biochemistry and Molecular Biology 27 (8-9): 735-743). 
Members of the cadherin superfamiiy have also been shown to bind Bt toxins 
5 (Vadlamudi RK, Weber E, Ji IH. Ji TH. and Bulla LA (1995) Journal of 

Biological C/jem/sf/y 270: 5490-5494; and Nagamatsu Y, Koike T. Sasaki K, 
Yoshimoto A, Funikawa Y, (1999) FEBS Letters 460: 385-390). Phosphatase 
enzymes have also been implicated In Bt toxin binding (Sangadalia S, Walters 
FS, English LH, Adang MJ, (1994) Journal of Biological Chemistry 269 (13): 

10 1 0088-1 0092). TPP-75, an elastase-like serine protease, binds to certain'Bt 
toxins and causes them to precipitate (Milne RE, Pang ASD, Kaplan H (1995; 
Insect Biochemistry and Molecular Biology 25 (1 0): 1 1 01 -111 4). BTR-270, a 
peptWoglycan, binds Cry1 A toxins with high affinity (Valaitis AP, Jenkins JL, 
Lee MK, Dean DH. Gamer KJ (2001 ) yArc/j/ves of Insect Biochemistry and 

15 P/iys/o/ogry 46 (4): 186-200). Bt toxins have also been shown to bind to 
nonprotein components of midgut epithelial membranes. Glycolipids from 
Manduca sexta have been shown to bind Cryl A toxins using an overlay 
technique (Garczynski SF and Adang MJ (2000) in Entomopathogenic 
Bacteria: From Laboratory to Field Application, K\uwer Academic Publishers, 

20 pp 181-197). Neutral lipids are involved in Bt toxin binding to Manduca sexto 
brush border membranes (Sangadala S, AzadI P, Carlson R, Adang M J 
(200i; Insect Biochemistry and Molecular Biology 32 (-{)'. 97-107). Neutral 
glycolipids, especially hexa- and tri-sacchaiylceramides, are implicated in 
Cryl A toxin binding in diamondback moth (Kumaraswami NS, Maruyama T, 

25 Kurabe S, Kishimoto T, Mitsui T, Hon H, (2001 ) Comparative BioGhemlsUy 
and Physiology B- Biochemistry & Molecular Biology 122 {1): 173-183). 

The relationship between binding targets for Bt-foxins and susceptibility 
or resistance to Bt is very complicated and not Gompieteiy undei^ood at the 
present time. Several hundred strains of Bacillus thuringiensis exist, with 

30 cx>nsiderabte specific^ toward various groups of insects. Co^olutbn 
between the insects and Bt has resulted in spedlioity of the interaction 
between Bt^oxin and the membranes of insect gut cells. The Bt-toxih of a 
particular strain of Sacr//us &}u/7/?g«nste may bind to file gut of s^ 
tefvae but not to ©there. Thuis, the Bt-^^ high spec^city lora 
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small number of insect pest species while having no significant activity against 
beneficial insects, wildlife, or humans. - - 

Plants transfonned to carry Bt genes and express insecficidal proteins 
are known in the art and include potato, cotton, tomato, com, tobacco, lettuce, 
5 and canola^ Transformed plants are known in the art as reflected in US 
Patent Nos. 5,608,142; 5,495,071 ; 5,349,124; and 5,254,799, the 
specifications of which are incorporated in their entirety herein by reference. 
The use of genetically engineered plants is designed to reduce the use of 
broad spectrum insecticides. 

1 0 There is concem that resistance may evolve to Bt toxins, whether they 

are applied to plants in spray formulations or the plants are genetically 
engineered to express them. The development of resistance to Bt-toxin 
expreissing crops may also result in resistance to commercial fomnuiations of 
fermented strains of Bt such as DIPEL® (Abbott Laboratories). 

15 Rapid, reliable methods for broad screening to distinguish and detect 

the development of Bt resistance In populations of insects are needed. 
Heretofore, ail methods require living or fresh-frozen insect larvae or 
preparations derived from them. The simplest methods employ bioassays on 
living insects, in which survivorship or larval metabolic rates are determined 

20 over time following a diet containing a specified concentration of a Bt-toxin. 
One such bioassay based on reduced metatx>lic rates after exposure to low 
doses of toxin mixed into artificial diet Is discussed in US Patent No. 
6,060,039 to Roe et aL which is incorporated herein by reference. Other 
bioassays are based on survival after exposure to a single, high diagnostic 

25 dose of toxin (for example, Diaz-Gomez O, Rodriguez JC, Shelton AM, 

Lagunes-T A, Bujanos-M R. (2000) Journal of Economic Entomology 93 (3): 
963-970). 

In principle, these bioassay methods can detect resistance no matter 
what its biochemical or physiological mechanism is. However, they require 
30 living, healthy larvae for their use, which are not alwa^ available. A more 
severe limitation on these methods is that, depending on the frequency of 
resistance genes in the populations^ millions of individuals may need to be 
tested to detect a single resistant larva. High-level resistance to Bt is usually 
recessive, which means that an insect must have two copies of the resistance 
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gene to be resistant To a very good approximation, the frequency of such 
homozygous individuals is given by the square of the frequency of the 
resistance allele. For example, if the resistance allele frequency is one in a 
thousand, the frequency of homozygous resistant individuals Is one in a 
5 millbn. In this example, more than a million larvae would need to be 
screened to detect resistance. 

One solution to this problem is to develop methods for detecting the 
resistance genes directly. In the example just given, the frequency of 
heterozygous carriers of one cx)py of the resistance allele Is 2 x 0.001 x 0.999 

10 or approximately 2 in a thousand. When resistance is recessive, these 

individuals would not be identified by bioassay because the one resistance 
allele they carry is not enough to make them fully resfstant. But a direct, 
DNA-based method for detecting the resistance allele would identify these 
individuals, and sample sizes on the order of a thousand, rather than a million, 

15 would suffice. 

The main limitation to developing DNA4)ased methods for detecting 
resistance alleles is that, up to now, the identity of resistance-causing genes 
has been unknown. In spite of much work on Bt toxin mode of action, prior to 
the invention described herein there has not l>een a demonstration of which 

20 genes, when mutated, actually cause resistance. Acxx>rdingly, there is room 
for variation and improvement in the art of screening assays useful In 
detecting the presence of genes inferring Bt resistance in natural 
populations. 

25 SUMMARY OF THE INVENTION 

It is one aspect of one of the present ihventtons to provkle a geniatic 
probe to identify and monitor resistanc;e for the Bt-toxin in target insect 
populations. One such insect pest is the tobacco budworm {Heliothis 
virescens) which is a major economte pest of cotton. 

30 it Is yet another aspect cyf one of the present inventions to develop a ' 

DNA probe and assay protocol which distinguishes between the conditions of 
tiomozygotes and heterozygotes with respect to resistance to Bt in 
populations of He/tofAi/s Wre^cens arKi other in^ 

It is yet another aspect of one of the presetit inventions to proSnde a_ 
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process and useful sequences in which nucleotide probes are used to monitor 
the presence of acquired insect resistance associated with a transgenic crop. 

It is yet another aspect of one of the present inventions to provide a 
pnDcess and useful nucleotide sequences whidi are usi3d to monitor 
population changes in the frequency of alleles which are associated vwth the 
resistance to Bt toxin. , 

These and other features, aspects, and advantages of the present 
invention will become better understood with reference to the following 
description and appended claims. 



BRIEF DESCRIPTION OF THE DRAWINGS 
A full and enabling disclosure of the present invention, including the 
best mode thereof, to one of ordinary skill In the art, is set forth more 
particularly in the remainder of the specification, including reference to the 
15 accompanying drawings. 

Figure 1 is a QTL map of the Cryl Ac resistance trait on linkage group 9 
of Heliothisvirescens. 

Figure 2 is a conceptual transiatton of HevGaLP (si allele and r1 allele) 
in alignment with BmBtRI 75 of Bombyx mon and BtR1 of Manduca ^Bxta. 
20 Figure 3 is a northern analysis of mRNA isolated from susceptible and 

resistant strains following probing with the gene sequences set forth herein. 

Figure 4 sets forth the Insertion point of the HeU element in the r1 
allele of HevCaLP. 

Figure 5 shows the multi-copy occurrence of HeM in genomic DNA of 
25 resistant and susceptible sirains of Heliothis vifBseens. 



BRIEF DESCRIPTION OF THE SEQUENCE LISTINGS 
The accompanying sequence ID listings are identified below^ The 
30 sequence listings appear following the claims and are incorporated herein by 
reference. 

The first sequence 1 identffies SEQ ID NO: 1 which is the DNA 
sequence of the susceptible allele si of HevCaLP. 

Sequence 2 is the protein sequence SEQ ID NO: 2 of a conceptual 
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translation of allele s1 as used in the protein alignment to Bombyx and 
Manduca. . ~ - 

Sequence 3 is the DNA sequence of SEQ ID NO: 3 which is the 
resistant allele r1 of HevCaLP, including the Hel-1 insert and the duplicated 
5 target sequences. 

Sequence 4 is the DNA insert identified as SEQ ID NO: 4 for the Hel-1 
insert which does not include duplicated target sequences. 

Sequence 5, having SEQ ID NO: 5, is a DNA sequence corresponding 
to the left LTR of the Hel-1 insert. 
10 Sequence 6, having SEQ ID NO: 6, is a DNA sequence corfespondfiig 

to the right LTR of the HeM Insert. 

Sequence 7, having SEQ ID NO: 7, is a DNA sequence of primer F1 
corresponding to bases 1 982 to 2001 of SEQ ID NO: 3. 

Sequence 8, having SEQ ID NO: 8, is a DNA sequence corresponding 
15 to primer R2 consisting of the reverse complement of bases 4322 to 4351 of 
SEQ ID NO: 3. 

Sequence 9, having SEQ ID NO: 9, is a DNA sequence corresponding 
to primer R3 consisting of the reverse complement of bases 2029 to 2052 of 
SEQ ID NO: 3. 

20 

DESCRIPTION OF THE PREFERRED EMBODIMENT 
Reference now will be made in detail to the embodiments of the 
invention, one or more examples of which ^re set forth l>elow. Each example 
is provided by way of explanation of the invention, not limitation of the 

25 Invention. In fact, it will be apparent to those skilled in the art that various 
modifications and variations can be made in the present invention v\nthout 
departing from the scope or spirit of the invention. For instance, features 
illustrated or described as part of one embodiment, can be used on another 
embodiment to yield a still further emtx>diment. Thus, it is intended that the 

30 r present invention cover such modificatidns and variations as come within the 
scope of the appended claims and their equivalents. Other objects, feaftires, 
and aspects of the present invention are disdosed in the follo>Aflng detailed 
description. It is to be understood by one of ordinary skill in the art that tiie 
present discussion is a description of exemplary enribodiments only and is not 
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intended as limiting the broader aspects of the present invention, which 
broader aspects are embodied In the exemplary constructions/ ■ " ^. ' ■ 

In describing the various figures herein, the same reference numbers 
are used throughout to describe the same material, apparatus or process 
5 pathway; To avoid redundancy, detailed descriptions of much of the 
apparatus once described in relation to a figure is not repeated in the 
descriptions of subsequent figures, although such apparatus or process is 
labeled with the same reference numbers. 

Applicants' protocols and procedures may be found in reference to 

10 "Identification of a Gene Associated witli Bt resistance in Heliothis virescens'' 
which was published in Science, volume 293, pp 857-860, on August 3, 2001 ; 
and which is incorporated herein by reference. 

A resistant strain of Hefiothis virescens was previously developed in 
the laboratory by selection using artificial diet containing various 

15 concentrations of Bt toxin (Gould F, Anderson A, Reynolds A, Bumgamer L, 
Moar W (1 995) Journal of Economic Entonuylogy 88 (6): 1 54&-1 559). The 
strain, named YHD2, is 10,000 fold more resistant to the toxin CrylAcand Is 
conditioned in a large part by a single recessive gene named BtR-4 which is 
located In linkage group 9 of H. virescens. The initial localization of the 

20 : resistance gene BtR-4 has been reported In the Applicants' prior publication 
(Meckel DG, Gahan LC, Gould F, Anderson A (^997y Journal of Economic 
Entomology 90: 75^86) and which is incorporated herein by reference. 

Further localization of BtR-4 to a particular region of linkage group 9 
was carried out using a total of 1 1 polymorphic markers spanning a length of 

25 105 cM. The markers were scored on a segregating backcross family derived 
from YHD2 females crossed with susceptible males. The linkage group was 
scanned for quantitative trait loci (QTLs) confemng Bt resistance following the 
methods of Lander, ES and Botstein D (1989) Genetics 121: 185-193. A 
single, highly significant peak of the log-likelihood function indicated that the 

30 BtR-4 resistance gene is located between A14 and MPI as set forth in Fig, 1. 
The cadherin superfamily was chosen as a candidate for BtR-4. 3 
Partially degenerate oligonucleotide primers Bmtp5 and Bmtp8 as shown in 
Table 1 were designed based on published sequence of the BtRI 75 gene 
from Bombyx mori (GenBank Accession No AB026260, described by 
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Nagamatsu Y, Toda S, Koike T. Miyoshi Y, Shigematsu S, Kogure M (1 998; 
Bioscience, Bioteclinology and Bioctiemistry 62 (4): 727-734)/ These primers 
were used in the polymerase chain reaction (PGR) with cDN A prepared from 
midgut mRNA of larval Heliothis virescens. A PGR product of 334 basepairs 
5 designated HvcadSS was amplified, cloned and sequenced using 
conventional methodology well-known to those skilled In the art. The 
sequence of Hvcad58 corresponds to bases 4279 to 4612 of SEQ ID NO: 1 . 

Radiolabeled Hvcad58 was used to probe Southern filters made from 
additional segregating backcross families for further mapping on linkage 

10 group 9. Finer scale QTL mapping in this region using 268 backcross 

progeny yielded a single peak of the log-likelihood function directly above the 
map location of Hvcad58 (Fig. 1). The data clearly Indicates that the gene 
containing Hvcad58 is a strong candidate for the BtR-4 resistance gene. 

The Hvcad58 probe was used to screen midgut cDNA libraries made 

15 from resistant (YHD2) and susceptible strains of Helioiiiis virescens. Clones 
recovered from these libraries were sequenced and used to design additional 
primers to amplify the full-length coding sequence from susceptible cDNA. In 
addition to the cDNA methods, a five-prime RACE (rapid amplification of 
cDNA ends) technique was used to complete the full sequence. 

20 The sequencing yielded one transcript (si) doned from a susceptible 

strain as given in SEQ ID NO: 1 . Conceptual translation of this transcript 
produced a protein product (that we have named HevCaLP, Heliottiis 
virescens cadherin-like protein) of 1 732 ariiino acids as given in SEQ ID NO: 
2. HevCaLP is 70% identical to the BtR176 protein, sharing a signal 

25 sequence at the amino terminus, 1 1 extra-cellular cadherin-type repeats, a 
non-oadherin proximal membrane region, a transmembrane regfon, and a 
highly conserved cytoplasmic domain at the carboxy terminus as shown in 
Fig. 2. It shows somewhat less similarity to the BT-R1 protein from Manduca 
sexta, as given in GenBank Accession No. AAB33758 and reported by 

30 Vadlamudi RK, WeberE. JliH,Ji TH, and Bulla UK (1995) Jot/ma/ p 

Biological Chemistry 270: 5490-5494. The transmembrane and cytbplasmic 
domains are absent from that sequence of BT-R1 . 

Expression of the mRNA encoding HevCaLP iri susceptible and 
resistant larval midguts was studied using northern analysis and sequeridng 
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of Clones from the lesiistant library. As shown in Fig. 3, susceptible larvae 
show a single transcript of 5.5 kb. YHD2 larvae show three tra'nsciTpb. The 
sequence of the rarest (7.9 kb) is denoted as the r1 allele, and given as set 
forth in SEQ ID NO: 3. It is similar to the susceptible transcript except for a 
5 2.3 kb insert denoted as Hel-1 as given in the accompanying SEQ ID NO: 4. 
Hel-rl shows several hallmiarks of thie LTR-type retrotransposbns. Hel-1 has 
an approximately 255 nucleotide long temninal repeat (LTR) sequence at both 
endsand an unrelated sequence in theniiddle. The left LTR sequence, 
LTRa, is given in SEQ ID NO: 5 and the right LTR sequence, LTRb, is given 

10 in SEQ ID NO: 6. Hel-1 is flanked by an 8-nt duplication of tiie host sequence 
ACACTGCC, as shown in Fig. 4. The transcript of intermediate abundance 
(4.4 kb) is an abbreviated fomi, truncated at the second LTR of Hel-1 by a 
poly^ taH. The third, highly abundant transcript (2.1 kb), is truncated at the 
first LTR of HeM by a poly-A tail. 

1 5 Because of an in-frame stop codon 30 bases into the first LTR of Hel-1 , 

conceptual translation of the three different YHD2 transcripts produces the 
same truncated 622-aa protein (as shown in the translation of the r1 allele in 
Fig. 2). Multiple stop codons in all three reading frames of the LTR follow the 
initial stop codon, preventing translation of a larger protein containing the 

20 cartx}xy-tenninus of HevCaLP. Thus, the predicted protein product of the 
YHD2 r1 allele (if one Is produced) would possess the same signal sequence 
as HevCaLP (possibly directing its secretion into the midgut lumen) but no 
predicted transmembrane domain or toxin-binding region. 

Genomic Southern blots probed with the LTR region of Hel-1 show that 

25 it occurs with a copy number of 1 0-1 5 in both YHD2 and susceptible insects 
(Fig. 5). Insertion of this Hel-1 element into the gene encoding HevCaLP has 
created the novel, knockout r1 allele which confers resistance when 
homozygous (present in two copies in an individual insect). This insertk>n 
event could have occunred in the laboratory during the Bt-resistance selection 

30 protocol that produced YHD2, or may already have been present in the field^ 
collected founders of the selection line. Thus it is now evident that a DNA- 
based method for detecting Bt resistance in Heliothis vitBsciens may be 
devised, t)ased on detection of the specific insertion of the HeM element into 
the gene encoding HevCaLP. producing the n allele: 
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To Illustrate detection of the ri allele, a PGR assay was designed using 
two primers flanking the Insertion point (F1 and R2) and a third (R3) Internal to 
the left LTR (Fig. 4). Primer F1 consists of bases 1982 to 2001 of SEQ ID 
NO: 3, 5' ATA CGA GCT GAC GAC ACG CTG GGA GA 3', primer R2 
5 consists of the reverse complement of bases 4322 to 4351 of SEQ ID NO: 3. 
5' TCT GAG GGT AGG AGG TGT GTT G7T GAT GTG 3', and primer R3 
consists of the reverse complement of bases 2029 to 2052 of SEQ ID NO: 3, 
5' GCG CGA TGT GAC AGT CCG GAA CAG 3\ Primers F1 and R3 produce 
a 71 -bp band from the r1 allele. Primers F1 and R2 amplify a 99-bp band 
10 from s1 or other susceptible alleles lacking the Hel-1 Insert. Heterozygotes 
produce both bands. This is a marked improvement on a conventional 
bloassay, which would not distinguish heterozygotes f rom homozygous 
susceptibles because the resistant allele Is recessive. It also confirms that the 
resistant strain is fixed for the r1 allele, as all YHD2 individuals examined to 
15 date have the 71 -bp band only. It will be evident to those skilled in the art that 
the detection method for the r1 alelle Is not limited to PGR with these specific 
primers, and that there are many other molecular methods of detecting the 
specific insertion of the Hel-1 element into the HevGaLP gene, based on the 
sequence information disclosed herein. 
20 It is believed that the gene encoding HevGaLP Is identical to BtR-4. the 

major resistance gene in YHD2. Recessivity of the resistant allele at BtR-4 is 
explained by Hel-1 Inactlvation of HevGaLP. HevGaLP functions as a "lethal 
targer of Bt-toxin, since two copies of the disrupted allele are required for 
10,000-fold resistance. Heterozygotes still present a ''lethal targef since they 
25 haveone copy of the susceptible allele. 

The normal physiological function of HevGaLP is unknown, although 
other members of the cadherin superfamily are involved in cell adhesion and 
signalling (T. Uemura (1998) Ce//93 (7): 1095-1098). Whatever its function, it 
is not essential for life, as YHD2 is viable and fertile under laboratory 
30 conditions, despite being a "natural knockour strain for HevGaLP. Whether 
its absence confers a fitness disadvantage in the field has important 
implications for resistance management and this question cain now be 
addressed with the informatfon developed here. Target-site resistance to 
other insecticides usually involves modification but not knockout of the target. 



wo 02/074079. 



12 



PCT/US02/67872 



which is generally essential for life (e.g., acetylcholinesterase for 
organophosphates, sodium channel for pyrethroids, GABA receptbf for 
cyclodienes) (French-Constant RH, Pittendrigh B, Vaughan A, Anthony N 
(1998) Philosophical Transactions of the Royal Society of London Series 
5 Biological Sciences 363 (1376): 1685-1693,). However, methoprene 
resistance in Dmsophila me/andgasfer provides another example of 
resistance by gene inactivation (Wilson TG & Ashok M, (1998) Proceedings 
of the National Academy of Sciences of the USA 95 (24): 14040^14044). 

The present invention now makes possjble the application of molecular 

10 methods to Bt-resistance monitoring. We previously estimated the frequency 
of YHD2-type resistant alleles in field populations of Heliothis virescens prior 
to widespread planting of transgenic Bt-cotton to be :Q.002 (Gould F, 
Anderson A, Jones A, Sumeiford D, Meckel DG, Lopez J, MIclnski S, Leonard 
R. Laster M (1997) Proceedings of the National Academy of Sciences of the 

15 USA 94 (8): 3519-3523). This labor-intensive, bioassay-based estimate was 
derived by testing progeny of more than 1 ,000 field-caught males mated to 
YHP2 females, for alleles which woukl confer resistance when heterozygous 
with r1 . Our results now suggest that this estimate covers the entire class of 
HevCaLP knockouts regardless of the nature of the molecular lesion, as well 

20 as other mutants preventing any expressed HevCaLP from functioning as a 
toxic target Development of efficient DNA-based methods to detect ttiese 
otiier types of mutants at BtR4 should be a high priority and is now possible 
with the methods described herein. 

Only by monitoring allele frequencies at resistance genes like BtR-4 

25 will It be possible to verify that the high-<Jose/refuge resistance management 
strategy for Bt-cotlon mandated by Hie US Environmental Protection Agency 
(EPA) is actually working to keep resistance allele levels low. The present 
invention affords a new method of complying with EPA regulations which 
require monitoring resistance levels in Heliothis viresqens. The present 

30 invention provides a nucleic acid probe that will specifically identify genes for 
resistance in field populations. Furttier, the probes and protocols set forth 
herein provide for a method of monitoring the population of homozygous and 
heterozygous resistant individuals in field populations. 

Bt resistance in Heliothis \nrescens caused by other types of mutations 
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that inactivate the HevCiaLP gene product may also be screened foruslng 
the information provided herein. Such methods may include obtaining 
portions of the gene or Its homologues by cDNA cloning or the polymerase 
chain reaction, determining the DNA sequence by standard methods, and 
5 examining the sequence for the occurrence mutatfons that may include 
nucleotide substitution, insertions, or deletions. Such mutations may affect 
protein sequences encoded by the gene by causing amino acid substitutions, 
insertions, or deletions as well as incorrect intron splicing, premature chain 
termination due to nonsense mutations, or errors in the normal Initiation or 

10 termination of the transcription or translation. 

By way of example, DNA or RNA isolated from individual Heliothis 
virescens is used as the template for PGR using prmiers specifically designed 
from SEQ ID NO: 1. The PGR products are directly sequenced, or cloned and 
sequenced, using standard methods. The sequences are examined using 

15 commercially available computer programs well known in the art, such as the 
Wisconsin Genetics Computer Group package. Mutations, such as individual 
nucleotide substitutions, insertions, or deletions; or insertions or deletions of 
several nucleotides, are detected by comparison to SEQ ID NO: 1 . Such 
mutations may alter the amino acid in the protein sequence, leading to 

20 reduced binding of Bt toxins to the HevGaLP gene product and thereby 

confening resistance. Or such mutations may cause frameshifts or premature 
occurrence of stop codons, resulting in a truncated or absent protein thiat fails 
to bind to Bt toxins and thereby confers resistance. 

In the course of this inventioni an isolated nucleic acid molecule of the 

25 present invention includes a nucleic acid that is at least about 85%, preferably 
at least about 90%, and still more preferably at least about 95%, and even 
more preferably at least about 99% identical to the sequence of the 
susceptible allele s1 of HevGaLP. Additionally, any isolated polynucleotide or 
naturally occurring polynucleotide that hybridizes to the sequence set forth in 

30 SEQ ID NO: 1 at 60''C in IX SSC will have properties useful In carrying out 
the present invention. 

Other embodiments of the present Invention include isolated hucleic 
acid molecules that are at least about 85%, preferably at least about 90%, still 
mors preferably at least about 95%, and even more preferably at least about 
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99%, identicaJ to the sequences set forth in SEQ ID NO: 3 and SEQ ID NO: 4. 

Bt resistance in other insect species may also be screened for using 
the same approach. These species may contain one or more genes 
homologous to the Heliothis virescens HevCaLP gene, whose products 
5 interact with Bt toxins. Resistance in these other species can be detected by 
obtaining the sequence of those genes, designing PCR primers, and 
amplifying and sequencing DNA from individual insects collected from the 
field or reared in the laboratory. Examination of the sequence for inactivating 
mutations as described herein will detect Bt resistance in those species 
10 Representative sequences of HevCaLP homologues in other spedes and ■ 
which may be used in the screening process described herein include the 
following: 

1) Manduca sexta BT-R1. GenBank Accession No. 177078. US Patent No. 
5693491 (SEQ ID NO: 1) and US Patent No, 6,007,981 (SEQ ID NO: 1); 
15 2) Bombyx mori BtR1 75, GenBank Accession No. AB026260, described by 
Nagamatsu Y. Toda Koike T, Miyoshi Y, Shigematsu S. Kogure M (1998) 
Bioscience^ Biotechnology and Biochemistry 62 (4): 727^734; 

3) Pectinophora gossy^iella BT-R2, GenBank Accession No. AX150183, 
Patent Application. International Publication No. WO01/34807 (SEQ ID NO: 

20 1); 

4) OsMnia nubilalis, GenBank Accession No. AX147201, Patent application. 
International Publication No. WO 01/36639 (SEQ ID NO: 1); 

5) He//coi^e/pazea, GenBank Accession No. AX1 47203, Patent application, 
International Publication No. WO01/36639 (SEQ ID NO: 3); 

25 6) Spodopferafrug/peAt/a, GenBank Accessfon No. AX1 47205, Patent 

application. International Publication No. WOO/136639 (SEQ ID NO: 5); and 
7) Ly/na/7/na cf/sparBTR-CAD, GenBank Accession No. AF31 7621. 

The above identified sequences and the referenced publications are all 
incorporated herein by reference as is set forth in their entirety. 

30 ;. The current methodology includes detecting resistance to Bacillus 

thuringiensis endotoxin in insect populations by screening for mutations that 
alter the structure or function of a protein as set forth in SEQ ID NO: 2. For 
the purposes of screening protocols, it is believed that usirig the sequence set 
forth in SEQ ID NO: 2 may include homologues and other species which 
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would display at least 60% similarity to the sequence set forth in SEQ ID NO: 
2. More preferably, the sequence similarity is at least about 75%, preferably 
at least about 80%, more preferably at least about 85%, even moire preferably 
at least about 90%, still more preferably at least about 95%, and even more 
5 preferably at least about 99% identical to the amino acid sequence set forth in 
SEQ. ID. NO: 2. 

Several of the mutations in other species detected by this approach 
may not have an obvious effect of activating the HevCaLP homologue. In that 
case, evidence that the mutation confers resistance may be obtained by 

10 conducting a linkage analysis and mapping the gene as described herein for 
Heliothis virescens. For that purpose, a strain of the species of interest with 
the mutation is crossed with a wild-type strain, and the F1 hybrids are 
intercrossed or backcrossed to one of the parental strains. The F2 or 
backcross progeny are tested for resistance by any of the bioassay methods 

15 described previously and well known in the art, and DNA is isolated from each 
individual progeny. The DNA is analyzed for the presence of the mutation, 
using restriction fragment polymorphism analysis, allele-specific PGR, 
denaturing gradient gel electrophoresis, single-stranded conformation 
polymorphism, denaturing high-perfomiance liquid chromatography, or any 

20 other mutation detection system well known in the art. Evidence that the 
mutation confers resistance is obtained from the correlation across progeny 
between presence of the mutation and presence of resistance. 

A straightfoPArard extension of this method of detecting Bt- resistance is 
to examine the DNA sequence of genes encoding other proteins that interact 

25 with Bt toxins. Including but not limited to aminopeptidases, alkaline 
phosphatases, elastirhlike serine proteases, and peptldoglycans. 

All cited references, publications, and sequence listings set forth herein 
are incorporated by reference In their entirety. 

These and other modifteatlons and variations to the present invention 

30 may be practiced by those of ordinary skill In the art^ without departing from 
the spirit and scope of the present invention. In addition, it should be 
underetood that aspects of the various embodiments may be interchanged 
both in whole or In part. Furthennore, thoseof ordinary skill In the art will 
appreciate that the foregoing description Is by way of example only, and Is not 
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primers ' Used i 


Bmtp 5 


5'- < 


Bmtp 8 


5'- ' 


HvtpOS 


5'- . 


HvtpOS 


5'- < 


CQpl 


5'- ' 


CGnotpf2 


5'- < 




AGG 


T7pl- ' 


5'- 


T7p2 


5'- 


rrapi 


5'- 


T3p2 


5'- 


CGp3 


5'- 


CgRot:p4 


5'« 




TGA 


RC36T4 


5'- 


RC36cg5 


5'- 


RC36cg5+ 


5'- 


SC3T6 


5'- 


SC3cg7 


5'- 


C6p5 


5'- 


CGnotp6 


5'- 




6GA 


Sintl 


5'- 


Sint2 


5'- 




5'- 


RUNI2 


5'- 


Hvtpl3 


5'- 


Hvtpl4 


5'- 


HvtplS 


5'- 


Hvi:pl6 


5'- 


Hv1:pl7 


5'- 


HvAll-RT 


5'- 


5'R5M/8 


5*- 




AAT 


HvAllFl 


5'- 


HVA11F3 


5'- 


HVA1XR2 


5'- 


HVA11R4 


5'- 


USTPOl 




USTP02 


5'.- 


USTP03 


,5'- 


F771-5Kspl 


5'- 


F77l-5Ksp2 


5'- 


738-3. 4Kapl 


5'- 


738-3.4Ksp2 


5'- 


738-3. 4Kap3 


5'- 


738-3 «4Ksp4 


5't 



In De-bermining the Structure of BtR4, the Cadherlii-«X±jce 
Polynucleotide in Heliothis virescens 

6TR CTG ACK CTT AAY ATC GAG CCC ACK GC —3' 
TAG 6GG YAC ROT RTC SC6 KAT GAA GT6 KCC -3' 
AGC CCA CTG CAT CTA TGC ACG GCA TGT TTG A —3' 
CCT GAC TTC GGT CTG GTC GTC CCT GGC —3' 

TGT GGA GTC AGC TTC CAT AGA GTC TTG TAT CAG CGT GTA —3' 
GAT ACG CGG CCG CAG GTC AGC AGA -GCT CTG TTG ATG GTG TCG 
GTG GAG A -3' 

TAA GTT GGG TAA CGC CAG GGT TTT CCC AGT CAC —3* 
GGC CAG TGA ATT GTA ATA CGA CTC ACT ATA GGG CG ~*3 ' 
GAT AAC AAT TTC ACA CAG GAA ACA GCT ATG ACC ATG —3' 
GAA ATT AAC GAC CCT TAA AGG GAA CAA AAG CTG GAG ->3 ' 
GGC ACG TTT TTT TCC ACT OAC GGG GTC QTG CG — 3 ' 
GAT ACG CGG CCG CGG GCA GTC TGA GC6 TAG GAG GTG TGT TGT 

GAC GTG TGT TCG CCT GAT CCT AAC TAC T -3 ' 
AGC CTC TTA AAT CCA TAG CGG TCT CCA G -^3 ' 
CTG GAG ACC GCT ATG GAT TTA AGA -3' 
ATG TTC GAG GTG CTG TAC CTC ACC G —3' 
ACA CGA ACA CAG GAT CGT GGA AGT T -3' 
TGT ATC TTC TGG AAC TCC GGC ACT TCG AAG ^TC —3' 
GAT ACG CGG CCG CAT GTG ATG GTT CTG CGT GCC GAC GAT GAA 
CTG -3' 

GCT AAG GAC CGG GAT ATT GAT GAT AGA GT —3' 
CGT GCG GGG CAG TCT GAG AGT AG —3' 
CAT ACA CGA CCG CAC GCG CAA CG -3' 
TGA GCG CCG AGG TGC AGG TGT AGG --3' 
CTG TAC ACA GCC GGC ATC TCC AC -3 ' 
CTG GAA GTT GAG GGT CAG CAC TCC AGT —3' 
AAC CGT CGT CTG GAA GCT CT —3' 
TCT TCG ATG CCG ATC AGA TCC GAG TC -3 ' 
GCG GCG CCG GGC ACC AAC, AAG CA —3 ' 
AAT AGA TGC TCT TAG ATA ATA CGA GTA TCT TAC —3 ' 
GAT ACG CGG GCG CGA GAA CTA TGA GAT GGC AGT CGA CGT GAG 
A —3' • 



CCC GAA CTT RTT GTT ATG GTT TGC TAC TGA -3* 
ATG GGC AAC GCA GTT AAC TAG CTG -3' 
CAT CCT. CGT GAC AAT CGA CGA TGC 3' 
CAG ACA GAA CGA GCT CTT TGT GCA —3 ' 
GCC GTG CAG CAG TTC GAT GAG AAG —3 ' 
CTC CCA CTG TAT CAG TAG CCA TCA —3' 
ACA ATC CTT CAG GGT TCG AGC CAT C -3' 
GTA CAA GAG AAA ATC GCG CGT TGC GT —3' 
CCT GAT CAA CTG GAA CGA TGA GCT 6 -3' 
CCA AAG TCC ACG GGC GGT TGC GCA C —3' 



TABLE 1 
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Priiiers Used in Determining the Structure of BtR4, the Cadherin-likei 
'■ PoXynuciebtide ' in Beliothis viresdens <Gontinued) 

738-3. 8sp6 5'- GTG TAA C6T ACT GTG CTC iSTG TAA TGC —3' 
738-C10sp8- : 5'- CCG TCT GAA ACA TGT CGA AGT CAT -3 ' 

TBROl 5'- GAG ACT- AGO ACC TAG ACG GTC GCT -3' 

TBR02 5'- TCC AAC GAG GTG TTC CTG GTG ACG— 3' 

CR9TfiR • 5'- CAC TGT TAC TGT CAA TGT. TCG AGA —3' 

MR-Prl 5'- CAC ACG TCA TCG TGC GCC CCA CCT AAG CTG —3' 

LTR-Pr2 5'- CTG GCG CGA CCT CAT AGG CCG GCG CGA TGT —3/ 



IiTR-1.9Kspl 

I.TR-1.9Ksp2 

Rint-Fwdl 

Rint-Rev2 

C36RE6EQ-P 

C36RESEQ-R 

pcllFa 

pcl2Pa 

pcl2Ra 

pcl3Fia. 

pcl3Ra 

pcl4Fa 

pcl4Ra 

pclSRa 

RlntS-Fi 

RintS-R2 

RintR-.R3 

RintR-F4 



5'- 
5'- 
5'- 
5'- 
5'- 
5'- 
5'- 
5'- 
5'- 
5'- 
5'- 
5'- 
5'w 

5'- 
5'- 

S't 



CGA 
GTA 
ATA 
TCT 
CCC 
CTC 
GAG 

cec 

T6G 
GAA 
TGC 
AAC 
CGA 
TTC 
ATA 
TCT 
GCG 
ACG 



ATC 
GTG 
CGA 
GAG 
GGC 
CAT 
ATG 
GTT 
TAC 
CAC 
CAG 
CCG 
ACT 
CTT 
CGA 
GAG 
CGA 
CGC 



AGC 

TGT 

GOT 

CGT 

ACC 

GGT- 

GCA 

TCG 

CTC 

GGC 

GCT 

CTG 

GCT 

CCA 

GCT 

CGT 

TGT 

AAC 



TGA 
GAT 
GAC 
AGG 
GAC 
CGT 
GTC 
CCG 
GGT 
GAA 
TAA 
CAT 
GCA 
CGT 
GAC 
AGG 
GAC 
GCG 



TTC 
GTG 
GAC 
AGG 
AAC 
ATG 
GAC 
TGT 
AGT 
ACG 
CTA 
TTG 
CGG 
CAT 
GAC 
AGG 
AGT 
CGA 



ATT 
ATC 
AC(3 
TGT 
TCC 
CCT 
GTG 
TCA 
TAA 
GGC 
CCG 
TTT 
CGA 
TGT 
ACG 
TGT 
CCG 
TCT 



GTC GCT 
CAG -3 ' 
CTG GGA 
GTT GTd? 
-3' 

TGA CAT 
AGA ATA 
GGA ATG 
GCC TGG 
ACC ACA 
AGG TAC 
AGA GTT 
AGA TCT. 
CGC CAT 
CTG GGA 
GTT GTT 
GAA CAG 
ACT CTT 



-3 ' 

GAG CC -3 ' 
GAT GTC —3' 

GTA -..3 ' 
CTG A -3 ' 
TC -3''- "^^ 
CAA T —3' 
GA -3' 
CA —3' 
ACA G --3' 
CCA T — 3^ 
ATT T —3' 
GA -3 ' 
GAT GTC —3' 
-3' 
-3' 



TAiLE i 
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THAT WHICH IS CLAIMED IS: 

1 . An isolated poi^ucleotide sequence selected from "the group 
consisting of SEQ ID NOs: 1 , 3. 4, 7, 8 and 9, 

2. A nucleotide sequence having at least an 85% identity of one of 
the sequences set forth In Claim 1 . 

3- An isolated nucleic acid sequence encoding the amino add 

sequence of SEQ ID NO: 2. 
4. Any isolated polynudeotide or naturally oGcunirig polynucleotide 

that hybridizes the sequence in Claim 1 at 60^'C in 1X BSC. 
5- An isolated polynucleotide encoding an allele of a gene which 

when homozygous confers resistance to Bacillus thuringiensis 

endotoxin comprising the sequence of SEQ ID NO: 3. 

6. A nucleotide sequence having at least an 85% identity to the 
sequence set forth in Claim 5. 

7. An isolated polynucleotide encoding a mobile genetic element 
capable of being inserted into the genomic DNA of Insects, comprising 
the sequence of SEQ ID NO: 4 . 

8. A nucleotide sequence having at least an 85% Identity to the 
sequence set forth in Claim 7. 

9. The complement of any of the sequences set forth in Claim 1 . 

10. A method of detecting resistance to Bacillus thuringiensis 
endotoxin in Heliotllis virescens populations by screening for the 
presence of mutations having a sequence selected from the group 
consisting of SEQ ID NO: 3 or SEQ ID NO: 4. 

11 - A method of detecting resistance to Bacillus thuringiensis 

endotoxin In insect populations by screening for mutations tiiat alter the 
structure or function of any protein encoded by the nudeotide 
sequence set forth in SEQ ID NO: 1 . 

12. A method of detecting resistance to Bacillus thuringiensis 

endotoxin in insect populations by screening for mutations that alter the 
stmcture or function of any protein having at least a 60% similarity to 
the sequence of SEQ ID NO: 2. 
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13- A method for detecting mutations in genes from insect 
populations by screening for the presence of insertions oTa"DNA 
sequence set forth in Claim 8. 

14- A process for monitoring Bt resistance associated with the 
presence of an r1 allele in an insect population associated with 
transgenic crops comprising the steps of: 

obtaining DNA from an individual insect; 

amplifying said DNA using primers having nucleotide sequences 
of SEQ ID NO: 7, SEQ ID NO: 8, and SEQ ID NO: 9. 

measuring the molecular size of said amplified DNA, thereby 
detemnining whether said individual Is a susceptible, a heterozygote, or 
a homozygote for said r1 allele. 

15- A process for determining Bt resistance in an insect species 
containing one or more genes homologous to the HevCaLP gene 
comprising the steps of: 

identifying a first gene encoding a cadherin-IIke protein; 

identifying a second gene, said second gene a mutant of said 
first gene, said mutant gene associated with Bt resistance; 

producing a set of primers for PGR amplification of a sample of 
DNA containing either one of said first gene or said second gene, said 
set of primers having at least one individual primer unique for a 
mutated portion of said second gene; 

amplifying DNA from an insect using said set of primers; 

separating said amplified DNA according to size; 

determining from said separated DNA whether said individual 
insect is a homozygote, a heterozygote, or a susceptible individual with 
respect to said mutant gene. 
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SEQUENCE LISTING<110> Clemson University<120> 
Polynucleotide encoding a gene conferring resistance to Bacillus 
thuringiensis toxins<130> CXU-352~PCT<160> 9 <170> Pa tent In version 

3.1<210> 1<211> 5355<212> DNA<213> Heliothis virescens<22t)><Z2i> 
CDS<222> (10) . . (5208) <223> 

<300><301> Linda J. Gahan and David G. Heckel<:3 02> Identification of a gene 
associated with Bt resistance in Heliothis virescens<303> Science<304> 
293<305> 5531<306> 857-860<307> 2001-08-03<3 08> GenBank AF367362<309> 
2001-08-12<313> (1) . . (5355)<300><308> GenBank AF367362<309> 2001-08- 
12<313> (i) . . {5355)<400> 1 

aactatgag atg gca gtc gac gtg aga ata ctg acg gca gcg gta ttg att 51 
Met Ala Val Asp Val Arg lie Leu Thr Ala Ala Val Leu lie 
1 5 10 

etc get get cat tta acg gtc gcg caa gat tgt tct tac atg gta gca^ 99 
Leu Ala Ala His Leu Thr Val Ala Gin Asp Cys Ser Tyr Met Val Ala 
15 20 25 30 - 

att cct aga cca gag cga cca gat ttt cct aat caa aat ttc gaa gga 147 
lie Pro Arg Pro Glu Arg Pro Asp Phe Pro Asn Gin Asn Phe Glu Gly 
35 40 45 

gta cca tgg agt cag aac ecc ctg tta cca gcg gag gat agg gaa gat 195 
Val Pro Trp Ser Gin Asn Pro Leu Leu Pro Ala Glu Asp Arg Glu Asp 
50 55 60 

gtg tgc atg aac gcg ttc gac ccg agt gee ttg aac ccc gtc ace gtc 243 
Val Cys Met Asn Ala Phe Asp Pro Ser Ala Leu Asn Pro Val Tlir Val 
65 70 75 

ate ttc atg gag gag gag ate gaa ggg gac gtg gee att gee agg ctt 291 
lie Phe Met Glu Glu Glu lie Glu Gly Asp Val Ala lie Ala Arg Leu 
80 85 90 

aac tac cga ggt acc aat act ccg ace gtg gta act cca ttt aac ttt 339 
Asn Tyr Arg Gly Thr Asn Thr Pro Thr Val Val Thr Pro Phe Asn Phe 
95 100 105 110 

ggt acc ttc cac ttg ttg ggg ccg gtc ata cgt a^g ate ccc gag caa 387 
Gly Thr Phe His Leu Leu Gly Pro Val lie Arg Arg lie Pro Glu Gin 
115 120 125 

9SS 999 9ac tgg cat ctt gtt att acg cag agg cag gac tat gag acc 435 
Gly Gly Asp Tip His Leu Val He Thr Gin Arg Gin Asp Tyr Glu Thr 
130 135 140 

ccg aac atg cag cag tat ate ttc aac gtg agg gta gaa gac gag ccc 483 
Pro Asn Met Gin Gin Tyr He Phe Asn Val Arg Val Glu Asp Glu Pro 
145 150 155 

cag gaa gee act gtg atg etc ate att gtc aac att gac gac aac get 531 
Gin Glu Ala Thr Val Met Leu He He Val Asn He Asp Asp Asn Ala 
160 165 170 

cct ate ata cag atg ttc gag cct tgt gac att cct gaa cac ggc gaa 579 
Pro He He Gin Met Phe Glu Pro Cys Asp He Pro Glu His Gly Glu 
175 180 185 190 

acg ggc acc aca gaa tgc aag tac gtg gtg age gat get gac ggc gag 627 
Thr Gly Thr Thr Glu Cys Lys Tyr Val Val Ser Asp Ala Asp Gly Glu 
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195. 



200 



205 



ate age- aca cgt: ttc atg acg ttc caa atd gag age gat cga aac gat 
He Ser Thr Arg PEe Met Thr Phe Gin He Glu Ser .Asp Arg Asn Rsp" 
210 215 220 

gaa gaa tat ttc gaa cte gtc aga gag aat att eag gga eag tgg atg 
Glu Glu Tyr Phe Glu Leu Val Arg Glu Asn He Gin Gly Glh Trp Met 
225 230 235 



675 



723 



tac gtc cat atg agg ctt ate etc aac aaa ect ctg gac tat gag gaa 
Tyr Val His Met Arg Leu He Leu Asn Lys Pro Leu Asp Tyr Glu Glu 
240 245 250 



771 



aac ccg ctg cat ttg ttt aga gtt aca get ttg gat tec eta eca aac 
Asn Pro Leu His Leu Phe Arg Val Thr Ala Leu Asp Ser Leu Pro Asn 
255 260 . 265 270 



8l9 



gtt cac aca gta acg atg atg gtg caa gta gag aac ata gag age aga 
Val His Thr Val Thr Met Met Val Gin Val Glu Asn He Glu Ser Arg 
275 280 285 



867 



cca eca egg tgg atg gag ate ttc gee gtg eag eag ttc gat gag aag 
Pro Pro Arg Txp Met Glu He Phe Ala Val Gin Gin Phe Asp Glu Lys 
290 295 300 



915 



aca gca caa gee ttc agg gtt cga gee ate gat gga gac acg gga ate 
Thr Ala Gin Ala Phe Arg Val Arg Ala He Asp Gly Asp Thr Gly He 
305 310 315 



963 



gat aaa cet att ttc tat agg ate gaa act gaa gaa age gag aaa gat 
Asp Lys Pro He Phe Tyr Arg He Glu Thr Glu Glu Ser Glu Lys Asp 
320 325 330 



1011 



ttg ttc agt gtt gaa aca ata gga get ggt egg gaa ggt get tgg ttt 
Leu Phe Ser Val Glu Thr He Gly Ala Gly Arg Glu Gly Ala Trp Phe 
335 340 345 350 



1059 



aaa gtc get eca ata gac agg gac act eta gaa aag gaa gtt ttc cac 
Lys Val Ala Pro He Asp Arg Asp Thr Leu Glu Lys Glu Val Phe His 
355 360 365 



1107 



gtg tct eta ata gcg tac aaa tat ggc gac aat gac gtg gaa gga agt 
Val Ser Leu He Ala Tyr Lys Tyr Gly Asp Asn Asp Val Glu Gly Ser 
370 375 380 



1155 



ccg tea ttc gag teg aag ace gat ate gtc att att gtg aac gac gtg 
Pro Ser Phe Glu Ser Lys Thr Asp lie Val lie He Val Asn Asp Val 
385 390 395 



1203 



aat gat eag gcg ccg gtg eca ttc cgt ect tea tac tac att gaa att 1251 
Asn Asp Gin Ala Pro Val Pro Phe Arg Pro Ser Tyr Tyr He. Glu He 
400 405 410 

atg gag gaa get gcg atg aca ttg aat tta gag gac ttt ggt ttc cac 1299 
Met Glu Glu Ala Ala Met Thr lieu Asn Leu Glu Asp Phe Gly Phe His . 
415 420 425 430 

gat aga ggt ctt ggt ccg cac gca eag tac aca gtg cac ctg gag age 1347 
Asp Arg Gly Leu Gly Pro His Ala Gin Tyr Thr Val His Leu Glu Ser 
435 ~ " ~440 ~" " .445 
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ate tec ecg gcg gga geg cac gag gcg ttc tac ate gcg ccg gag gtg 1395 
lie Ser Pro Ala Gly Ala His Glu Ala Phe Tyr He Ala Pro Glu Val ■ 
450 455 460 - - 

ggc tac cag cga cag tec ttc ate gtc ggc acg cag aac cat cac . atg 1443 
Gly Tyr Gin Arg Gin Ser Phe He Val Gly Tlir Gin Asn His His Met 
465 470 475 

etc gac ttt gaa gtg cca gag ttc caa aag ata caa ctt agg gcg gta = 1491 
Leu Asp Phe Glu Val Pro Glu Phe Gin Lys He Gin Leu Arg Ala Val 
480 485 490 

gee ata gac atg gac gat cec agg tgg gtt ggt ate gcg att ata aac 1539 
Ala He Asp Met Asp Asp Pro Arg Trp Val Gly He Ala He He Asn 
495 500 505 510 

att aac ctg ate aac tgg aac gat gag ctg ccg ate ttc gag cac gac - 15 8-7 
He Asn Leu He Asn Trp Asn Asp Glu Leu Pro He Phe Glu His Asp 
515 520 525 

gtg cag act gtc acc ttc aag gag acg gag ggc get ggc ttc egg gtc 1635 
Val Gin Thr Val Thr Phe Lys Glu Thr Glu Gly Ala Gly Phe Arg Val 
530 535 540 

gee act gtt ctg gca aag gac agg gat att gat gat aga gtc gaa cat 1683 
Ala Thr Val Leu Ala Lys Asp Arg Asp He Asp Asp Arg Val Glu His 
545 550 555 

tct eta atg ggc aac gca gtg aac tac ctg agt ate gac aaa gac acc 1731 
Ser Leu Met Gly Asn Ala Val Asn Tyr Leu Ser He Asp Lys Asp Thr 
560 565 570 

ggc gac ate etc gtg aca ate gac gat gca ttt aac tat cac aga cag 1779 
Gly Asp He Leu Val Thr He Asp Asp Ala Phe Asn Tyr His Arg Gin 
575 580 585 590 

aac gag etc ttt gtg cag ata cga get. gat gac acg ttg gga gag ccg 1827 
Asn Glu Leu Phe Val Gin He Arg Ala Asp Asp Thr Leu Gly Glu Pro 
595 600 605 

tat aat acc aac acc gee caa ctg gtg ata cag ctg caa gac ate aac 1875 
Tyr Asn Thr Asn Thr Ala Gin Leu Val He Gin Leu Gin Asp He Asn 
610 615 620 

aac aca cct cet acg etc aga ctg cct cgc acg act ccg tea gtg gaa 1923 
Asn Thr Pro Pro Thr Leu Arg Leu Pro Arg Thr Thr Pro Ser Val Glu 
625 630 635 

gag aac gtg ccg gac ggg ttc gtg ate cec acc gag ctg cac gee ace 1971 
Glu Asn Val Pro Asp Gly Phe Val He Pro Thr Glu Leu His Ala Thr 
640 645 650 

gac cec gac acc acc gcc gag ctg cgc ttc age ate gac tgg gac act 2019 
Asp Pro Asp Thr Thr Ala Glu Leu Arg Phe Ser He Asp Trp Asp Thr 
655 660 665 670 

tec tat gee acc aag cag gga cga gat get gat get gag gag ttt gtt 2067 • 

Ser Tyr Ala Thr Lys Gin Gly Arg Asp Ala Asp Ala Glu Glu Phe Val 
675 680 685 
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aat tgt ata gaa ate gag acg gta tac ccg aac ttg aac gac cga ggc 2115 
Asn Cys lie Glu He Glu Thr Val Tyr Pro Asn Leu Asn Asp Arg Gly 
690 695 700 

acc gcc ate ggc cgc gtg gtg gtt cgc gag ate egg gaa cac gtc ace 2163 
Thr Ala He Gly Arg Val Val Val Arg Glu He Arg Glu His Val Thr 
705 710 715 



ate gac tac gag atg ttc gag gtg ctg tac etc acc gtc agg gtc acg 
He Asp Tyr Glu Met Phe Glu Val Leu Tyr Leu Thr Val Arg Val Thr 
720 725 730 



2211 



gat etc aac acg gtc att gga gac gac tat gat ata tea aca ttc acg 2259 
Asp Leu -Asn Thr Val He Gly Asp Asp Tyr Asp He Ser Thr Phe Thr 
735 740 745 750 

ate att ata ata gac atg aac gac aac ect ccg ctg tgg gtg gaa ggc 2307 
He He He He Asp Met Asn Asp Asn Pro Pro Leu Trp Val Glu Gly - 
755 760 765 

acg ctg acg cag gag ttc cgc gtg cga gag gtc gcc gcc teg gga gtt 2355 
Thr Leu Thr Gin Glu Phe Arg Val Arg Glu Val Ala Ala Ser Gly Val 
770 775 780 

gtt ata gga tec gta etc gcc act gat ate gac gga ect ett tat aat 2403 
Val He Gly Ser Val Leu Ala Thr Asp He Asp Gly Pro Leu Tyr Asn 
785 790 795 

caa gtg egg tat ace ate act ect aga tta gat act eca gaa gac eta 2451 
Gin Val Arg Tyr *Hir He Thr Pro Arg Leu Asp Thr Pro Glu Asp Leu 
800 805 810 

gtg gag ate gac ttc aat teg ggt cag ate tea gtg aag aag cac cag 2499 
Val Glu He Asp Phe Asn Ser Gly Gin He Ser Val Lys Lys His Gin 
815 820 825 830 

« 

gcc ate gac gcg gac gag ccg ccg cgc cag cac etc tac tac ace gtg 2547 
Ala He Asp Ala Asp Glu Pro Pro Arg Glh His Leu Tyr Tyr Thr Val 
835 840 845 

gtc gcc age gac aag tgc gac ctg etc tct gtc gac gtg tgt ccg ect 2595 
Val Ala Ser Asp Lys Cys Asp Leu Leu Ser Val Asp Val Cys Pro Pro 
850 855 860 

gat ect aac tac ttc aac aca ccg gga gac ata acg ate cac ata aca 2643 
Asp Pro Asn Tyr Phe Asn Thr Pro Gly Asp He Thr He His He Thr 
865 870 875 

gac acg aac aac agg gtg ect cga gtg gaa gag gac aag ttc gag gaa 2691 
Asp Thr Asn Asn Arg Val Pro Arg Val Glu Glu Asp Lys Phe Glu Glu 
880 885 890 

att gtc tat ate tac gag ggc gcg gag gac gga gaa cac gtc gtg cag 2739 
He Val Tyr He Tyr Glu Gly Ala Glu Asp Gly Glu His Val Val Glh 
895 -900 905 910 

etc ttc gcc age gat ctg gat aga gat gaa ate tac cac aaa gtg age 2787 
Leu Phe Ala Ser Asp Leu Asp Arg Asp Glu He Tyr His Lys Val Ser 
915 920 925 



tac cag ate aac tac- gcg ate aac ect cgt etc cgc gac ttc ttc gag 



2835 
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Tyr Gin lie Asn Tyx Ala lie Asn Pro Arg Leu Arg Asp Phe Phe Glu 
930 935 940 

gta gac ctg gag acc ggc ctg gtg tac gtc aac aac acg gcc ggg gag - — 2883 
Val Asp Leu Glu Thr Gly Leu Val Tyr Val Asn Asn Thr Ala Gly Glu 
945 950 955 

aag etc gac egg gac ggc gat gaa ccc acg cat egg ate ttc ttc aac 2931 
Lys Leu Asp Arg Asp Gly Asp Glu Pro Tlir His Arg He Phe Phe Asn 
960 965 970 

gtt ate gat aac ttc tat ggg gaa gga gat ggc aac egg aac cag gac 2979 
Val He Asp Asn Phe Tyr Gly Glu Gly Asp Gly Asn Arg Asn Gin Asp 
975 980 985 990 

gag aca caa gtg tta gtg gtg ctg ttg gac ate aac gac aac tat ccg 3027 
Glu Thr Gin Val Leu Val Val Leu Leu Asp He Asn Asp Asn Tyr Pro 
995 1000 1005 

gag ctg ect gag ggt etc tea tgg gat ate tet gag gga ttg eta 3072 
Glu Leu Pro Glu Gly Leu Ser Trp Asp He Ser Glu Gly Leu Leu 
1010 1015 ' 1020 

cag ggt gtc cgt gta acc cca gat ate ttc gcc ccg gac cgc gac 3117 
Gin Gly Val Arg Val Thr Pro Asp He Phe Ala Pro Asp Arg Asp 
1025 1030 1035 

gag ccc ggc acc gac aac tec cgc gtg gcg tac gac ate gtc age 3162 
Glu Pro Gly Thr Asp Asn Ser Arg Val Ala Tyr Asp He Val Ser 
1040 1045 1050 

etc teg ccc acc gac agg gac ate aca ett ect caa etc ttc ace 3207 
Leu Ser Pro Thr Asp Arg Asp He Thr Leu Pro Gin Leu Phe Thr 
1055 1060 1065 

atg ate acc ata gag aag gac agg ggc ate gac cag act gga gag 3252 
Met He Thr He Glu Lys Asp Arg Gly He Asp Gin Thr Gly Glu 
1070 1075 1080 

Ctg gag acc get atg gat tta aga ggc tat tgg ggc act tat gaa 3297 
Leu Glu Thr Ala Met Asp Leu Arg Gly Tyr Trp Gly Thr Tyr Glu 
1085 1090 1095 

ata eat gta aag gca tac gac eat gga gta ect caa agg att tec 3342 
He His Val Lys Ala Tyr Asp His Gly Val Pro Gin Arg He Ser 
1100 1105 1110 

tac gag aag tac ccg eta gtt att aga ect tac aac ttc eac gac 3387 
Tyr, Glu Lys Tyr Pro Leu Val lie Arg Pro Tyr Asn Phe His Asp 
1115 1120 1125 

ect gtg ttc gtg ttc ect caa ect gga atg act ate aga etc gcg 3432 
Pro Val Phe Val Phe Pro Gin Pro Gly Met Thr He Arg Leu Ala . 

1130 1135 1140 

aag gag ega gca gta gtg aac gga gtg ctg gcg aca gtg gac ggc 3477 
Lys Glu Arg Ala Val Val Asn Gly Val Leu Ala Thr Val Asp Gly 
1145 1150 1155 

gag ttc etc gag ega ata gtc gee ace gac gag gat ggc tta. eac . 3522 
Glu Phe Leu Glu Arg He Val Ala Thr Asp Glu Asp Gly Leu His 
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1160 . . . - - 1165 1170 

get gga gtt gtt acc ttc tct ate teg gga gat gat gag gcg ttg 3567 
Ala Gly Val Val Thr Phe Ser lie Ser Gly Asp Asp Glu Ala Leu - ~- 
1175 1180 1185 

cag tac ttc gac gtg ttt aac gae gga gtg aac ttg ggt gcg ctg 3612 
Gin Tyr Phe Asp Val Phe Asn Asp Gly Val Asn Leu Gly Ala Leu 
1190 1195 1200 

acc ate acg cag etc ttc ect gaa gac ttc cga gag ttt cag gtg 3657 
Thr lie Thr Gin Leu Phe Pro Glu Asp Phe Arg Glu Phe Gin Val 
1205 1210 1215 

acg att cgt get aeg gat ggt ggt acg gag cet ggt eca agg agt 3702 
Thr He Arg Ala Thr Asp Gly Gly Thr Glu Pro Gly Pro Arg Ser 
1220 1225 1230 

acc gac tge acg ate acc gta gtg ttt gtg ect acg cag gga gag 3747 
Thr Asp Cys Thr He Thr Val Val Phe Val Pro Thr Gin Gly Glu 
1235 1240 1245 

ect gtg ttc gaa act age acc tac acg gtc get ttt ate gag aaa 3792 
Pro Val Phe Glu Thr Ser Thr Tyr Thr Val Ala Phe He Glu Lys 
1250 1255 1260 

gat get ggt atg gaa gag egg gee aeg ctg ect etc gee aag gae 3837 
Asp Ala Gly Met Glu Glu Arg Ala Thr Leu Pro Leu Ala Lys Asp 
1265 1270 1275 

ccg cgt aac ata atg tgt gaa gat gat tgt cac gac acc tat tac 3882 
Pro Arg Asn He Met Cys Glu Asp Asp Cys His Asp Thr Tyr Tyf 
1280 1285 1290 

age att gtt gga gge aac teg atg ggc cac ttt gcg gtg gac cet 3927 
Ser He Val Gly Gly Asn Ser Met Gly His Phe Ala Val Asp Pro 
1295 1300 1305 

cag tec aac gag ctg ttc ctg ctg acg eca ctg gag cgc gcg gag 3972 
Gin Ser Asn Glu Leu Phe Leu Leu Thr Pro Leu Glu Arg Ala Glu 
1310 1315 ' 1320 

cag gag aeg cac ace etc ate ate ggc gee age gac teg ccc age 4017 
Gin Glu Thr His Thr Leu He He Gly Ala Ser Asp Ser Pro Ser 
1325 1330 1335 

eca gcc gcc gtg ctg cag get tct acc etc act gtt act gtc aat 4062 
Pro Ala Ala Val Leu Gin Ala Ser Thr Leu Thr Val Thr Val Asn 
1340 1345 1350 

gtt cga gaa gca aac ccg egg eca gtg ttc cag age get ctg tac 4107 
Val Arg Glu Ala Asn Pro Arg Pro Val Phe Gin Ser Ala Leu Tyr 
1355 1360 1365 

aca gcc gge ate tee ace etc gae ace ate aac aga ggt ctg eta 4152 
Thr Ala Gly He Ser Thr Leu Asp Thr He Asn Arg Gly Leu Leu 
1370 1375 1380 

acg eta cac gcg act cat tea gaa ggc ttg cet gtg acc tac acg 4197 
Thr Leu His Ala Thr His Ser Glu Gly Leu Pro Val Thr Tyr Thr 
_ .. 1385 - V : 1390 - . 1395--^ - - - 
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ctg gta caa gac tec atg gaa get gac tec aca ctg caa get gtg 4242 
Leu Val Gin Asp Ser Me]t Glu Ala Asp Ser Thr Leu Gin Ala Val 

1400 1405 1410 - - 

cag gag aca gee ttc aac ttg aac cct cag act gga gtg ctg ace 4287 
Gin Glu Thr Ala Phe Asn Leu Asn Pro Gin Thr Gly Val Leu Thr 
1415 1420 1425 . 

etc aac ttc cag ccc aca gea tet atg cac ggc atg ttt gag tte 4332 
Leu Asn Phe Gin Pro Thr Ala Ser Met His Gly Met Phe Glu Phe 
1430 1435 1440 

gat gtg atg get act gat aca gtg gga gaa ace geg cge acc gaa 4377 
Asp Val Met Ala Thr Asp Thr Val Gly Glu Thr Ala Arg Thr Glu 
1445 1450 1455 

gtg aag gtg tae ctg ata tec gac cgc aac aga gtg ttc ttc acg - 442^ 
Val Lys Val Tyr Leu lie Ser Asp Arg Asn Arg Val Phe Phe Thr 
1460 1465 1470 

ttc atg aac acg ctt gaa gaa gtc gaa ccg aat gaa gat ttc ata 4467 
Phe Met Asn Thr Leu Glu Glu Val Glu Pro Asn Glu Asp Phe lie 
1475 1480 1485 

geg gag aca ttt ace ctg ttc ttc ggc atg egg tgc aac ate gac 4512 
Ala Glu Thr Phe Thr Leu Phe Phe Gly Met Arg Cys Asn Tie Asp 
1490 1495 1500 

cag geg ctg ccc gcc age gac ccc gee acc ggc gcc gee agg gac 4557 
Gin Ala Leu Pro Ala Ser Asp Pro Ala Thr Gly Ala Ala Arg Asp 
1505 1510 1515 

gac cag acc gaa gtc agg gca cac ttc ata cgc gac gac ctg cct 4602 
Asp Gin Thr Glu Val Arg Ala His Phe lie Arg Asp Asp Leu Pro 
1520 1525 1530 

gta cct get gag gag ate gaa caa tta "cgc ggc aac ceg acc eta 4647 
Val Pro Ala Glu Glu lie Glu Gin Leu Arg Gly Asn Pro Thr Leu 
1535 1540 1545 

gtg geg acc ate cag aac gcc ctg cag gag gag aac ctg aac ctg 4692 
Val Ala Thr lie Gin Asn Ala Leu Gin Glu Glu Asn Leu Asn Leu 
1550 1555 1560 

gcc gac ctg ttc acg ggc gag act ccc ate ctg ggc ggc gag geg 4737 
Ala Asp Leu Phe Thr Gly Glu Thr Pro He Leu Gly Gly Glu Ala 
1565 1570 1575 

cag geg egg geg gtc tat get etc geg geg gtg geg get geg etc 4782 
Gin Ala Arg Ala Val Tyr Ala Leu Ala Ala Val Ala Ala Ala Leu 
1580 1585 1590 

geg ctg etc tgc gtc gtg ctg ctt ata etc tte tte ate agg act 4827 
Ala Leu Leu Cys Val Val Leu Leu He Leu Phe Phe He Arg Thr 
1595 1600 1605 



agg gee etc aac cgt cgc ctg gaa gee eta tec atg acc aag tae 
Arg . Ala Leu Asn Arg Arg Leu Glu Ala Leu Ser Met Thr Lys Tyr 
i^l<> 1615 1620 



4872 
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agt tec caa gac tea gga etc aac cgc gtg ggt etg gcg gcg ccg 
Ser Ser Gin Asp Ser Gly. Leu Asn Arg Val Gly Leu Ala Ala Pro 
1625 1630 1635 



4917 



ggc ace aac aag eac gcg gtg gag ggc tec aac ecc ate tgg aac 
Gly Thr Ash Lys His Ala Val Glu Gly Ser Asn Pro lie Trp Asn 
1640 ' 1645 1650 



4962 



gaa act ctt aag gca ccg gac ttt gat get ett age gag cag teg 
Glu Thr Leu Lys Ala Pro Asp Phe Asp Ala Leu Ser Glu Gin Ser 
1655 1660 1665 



5007 



tac gac teg ggt ctg ate ggc ate gaa gac ttg ccg cag ttc agg 
Tyr Asp .-Ser Gly Leu lie Gly lie Glu Asp Leu Pro Gin Phe Arg 
1670 1675 1680 

aac gac tac ttc ccg ect gac gag gag age tec atg egg gga gtc 
Asn Asp Tyr Phe Pro Pro Asp Glu Glu Ser Ser Met Arg Gly Val 
1685 1690 1695 

gtc aat gaa eac atg ect gga get aat tea gta gca aac eat aac 
Val Asn Glu His Met Pro Gly Ala Asn Ser Val Ala Asn His Asn 
1700 1705 1710 

aat aac ttc ggg ttc aac get acc ecc ttc age cca gag ttc gcg 
Asn Asn Phe Gly Phe Asn Ala Thr Pro Phe Ser Pro Glu Phe Ala 
1715 1720 1725 

aac teg cag etc agg aga taa aacattatag tattttttat ataatattat 
Asn Ser Gin Leu Arg Arg 
1730 



5052 



5097 



5142 



5187 



5238 



aaagaagtga tataacgcac taaaatttac ctataagtat atattgaagt gtaagatact 5298 
cgtattatgt aagagcatct atttttttac caeeagacaa taaaaacttt ataaaag 5355 

<210> 2<211> 1732<212> PRT<213> . H^liOthis virescens<400> 2 

Met Taa Val Asp Val Arg lie Leu Thr Ala Ala Val Leu Xle Leu Ala 
1 5 Id * 15 

Ala His Leu Thr Val Ala Gin Asp Cys Ser Tyr Met Val Ala lie Pro 
20 25 30 

Arg Pro Glu Arg Pro Asp Phe Pro Asn Gin Asn Phe Glu Gly Val Pro 
35 40 45 

Trp Ser Gin Asn Pro Leu Leu Pro Ala Glu Asp Arg Glu Asp Val Cys 
50 55 60 

Met Asn Ala Phe Asp Pro Ser Ala Leu Asn Pro Val Thr Val lie Phe 
65 70 75 80 



Met Glu Glu Glu He Glu Gly Asp Val T^a He Ala Arg Leu Asn Tyr 
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Arg Gly Thr Asn Thr Pro Thr Val Val Thr Pro Phe Asn Phe Gly Thr 
100 105 110 - - 



Phe His lieu Leu Gly Pro Val lie Arg Arg lie Pro Glu Gin Gly Gly 
115 120 125 



Asp Trp His Leu Val lie Thr Gin Arg Gin Asp Tyr Glu Thr Pro Asn 
130 135 140 



Met Gin Gin Tyr lie Phe Asn Val Arg Val Glu Asp Glu Pro Gin Glu 
145 150 155 160 



Ala Thr Val Met Leu lie He Val Asn He Asp Asp A^n Ala Pro He 
165 170 175 



He Gin Met Phe Glu Pro Cys Asp He Pro Glu His Gly Glu Thr Gly 
180 185 190 



Thr Thr Glu Cys Lys Tyr Val Val Ser Asp Ala Asp Gly Glu He Ser 
195 200 205 



Thr Arg Phe Met Thr Phe Gin He Glu Ser Asp Arg Asn Asp Glu Glu 
210 215 220 



Tyr Phe Glu Leu Val Arg Glu Asn He Gin Gly Gin Trp Met Tyr Val 
225 230 235 240 



His Met Arg Leu He Leu Asn Lys Pro Leu Asp Tyr Glu Glu Asn Pro 
245 250 255 



Leu His Leu Phe Arg Val Thr Ala Leu Asp Ser Leu Pro Asn Val His 
260 265 270 



Thr Val Thr Met Met Val Gin Val Glu Asn He Glu Ser Arg Pro Pro 
275 280 285 



Arg Trp Met Glu He Phe Ala Val Gin Gin Phe Asp Glu Lys Thr Ala 
290 295. 300 



Gin Ala Phe Arg Val Arg Ala He Asp Gly Asp Thr Gly He Asp Lys 

310 315 320 



Pro He Phe Tyr Arg He Glu Thr Glu Glu Ser Glu Lys Asp Leu Phe 
325 330 335 
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Ser Val Glu Thr lie Gly Ala Gly Arg Glu Gly Ala Trp Phe Lys Val 
340 345 350 

Ala Pro lie Asp Arg.Asp Thr Leu Glu Lys Glu Val Phe His Val Ser 
355 360. 365 

Leu lie Ala Tyr Lys Tyr Gly Asp Asn Asp Val Glu Gly Ser Pro Ser 
370 375 380 

Phe Glu ,Ser Lys Thr Asp lie Val lie lie Val Asn Asp Val Asn Asp 

390 395 400 

Gin Ala Pro Val Pro Phe Arg Pro Ser Tyr Tyr lie Glu lie Met Glu - 
405 410 415 

Glu Ala Ala Met Thr Leu Asn Leu Glu Asp Phe Gly Phe His Asp Arg 
420 425 430 

Gly Leu Gly Pro His Ala Gin Tyr Thr Val His Leu Glu Ser lie Ser 
435 440 445 

Pro Ala Gly Ala His Glu Ala Phe Tyr lie Ala Pro Glu Val Gly Tyr 
450 455 460 

Gin Arg Gin Ser Phe He Val Gly Thr Gin Asn His His Met Leu Asp 

470 475 480 

Phe Glu Val Pro Glu Phe Gin Lys He Gin Leu Arg Ala Val Ala He 
485 490 495 

Asp Met Asp Asp Pro Arg Trp Val Gly He Ala He He Asn He Asn 
500 505 510 

Leu He Asn Trp Asn Asp Glu Leu Pro He Phe Glu His Asp Val Gin 
515 520 525 

Thr Val Thr Phe Lys Glu Thr Glu Gly Ala Gly Phe Arg Val Ala Thr 
530 535 540 

Val Leu Ala Lys Asp Arg Asp He Asp Asp Arg Val Glu His Ser Leu 
545 550 555 550 

Met Gly Asn Ala Val Asn Tyr Leu Ser He Asp Lys Asp Thr Gly Asp 
565 570 575 
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lie Leu Val Thr lie Asp Asp Ala Phe Asn Tyr Kis Arg Gin Asn Glu 
580 585 590 

lieu Phe Val Gin lie Arg Ala Asp Asp Thr Leu Gly Glu Pro Tyr Asn 
595 600 605 

Tlir Asn Thr Ala Gin Leu Val lie Gin Leu Gin Asp lie Asn Asn Thr 
610 615 620 

Pro Pro Thr Leii Arg Leu Pro Arg Thr Thr Pro Ser Val Glu 6lu Asn 
^25 630 635 640 

Val Pro Asp Gly Phe Val lie Pro Thr Glu Leu His Ala Thr Asp Pro 
645 650 655 

Asp Thr Thr Ala Glu Leu Arg Phe Ser lie Asp Trp Asp Thr Ser Tyr 
660 665 '670 

Ala Thr Lys Gin Gly Arg Asp Ala Asp Ala Glu Glu Phe Val Asn Cys 
675 680 685 

lie Glu lie Glu Thr Val Tyr Pro Asn Leu Asn Asp Arg Gly Thr Ala 
690 695 700 

He Gly Arg Val Val Val Arg Glu He Arg Glu His Val Thr He Asp 
705 710 715 720 

Tyr Glu Met Phe Glu Val Leu Tyr Leu Thr Val Arg Val Thr Asp Leu 
725 -> 730 735 

Asn Thr Val He Gly Asp Asp Tyr Asp He Ser Thr Phe Thr He He 
740 745 750 

He He Asp Met Asn Asp Asn Pro Pro Leu Trp Val Glu Gly Thr Leu 
755 760 765 

Thr Gin Glu Phe Arg Val Arg Glu Val Ala Ala Ser Gly Val Val He 
770 N 775 780 

Gly Ser Val Leu Ala Thr Asp lie Asp Gly Pro Leu Tyir Asn Gin Val 
785 790 795 800 

Arg Tyr Thr He Thr Pro Arg Leu Asp Thr Pro Glu Asp Leu Val Glu 
805 810 815 

He Asp Phe Asn Ser Gly Gin He Ser Val Lys Lys His Gin Ala He 
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Asp Ala Asp Glu Pro Pro Arg Gin His Leu Tyr Tyr Thr Val Val Ala- 
835 840 .845 



Ser Asp Lys Cys Asp Leu Leu Ser Val Asp Val Cys Pro Pro Asp Pro 
. 850 855 860 



Asn Tyr Plie Asn Thr Pro Gly Asp lie Tlir lie His lie Thr Asp Thr 
865 870 875 880 



Asn Asn Arg Val Pro Arg Val Glu Glu Asp Lys Phe Glu Glu lie Val 
885 890 895 



Tyr lie Tyr Glu Gly Ala Glu Asp Gly Glu His Val Val Gin Leu Phe 
900 905 910 



Ala Ser Asp Leu Asp Arg Asp Glu lie Tyr His Lys Val Ser Tyr Gin 
915 920 925 



lie Asn Tyr Ala lie Asn Pro Arg Leu Arg Asp Phe Phe Glu Val Asp 
930 935 940 



Leu Glu Thr Gly Leu Val Tyr Val Asn Asn Thr Ala Gly Glu Lys Leu 
945 950 955 960 



Asp Arg Asp Gly Asp Glu Pro Thr His Arg lie Phe Phe Asn Val lie 
965 970 975 

Asp Asn Phe Tyr Gly Glu Gly ABp Gly Asn Arg Asn Gin Asp Glu Thr 
980 985 990 

Gin Val Leu Val Val Leu Leu Asp lie Asn Asp Asn Tyr Pro Glu lieu 
995 1000 1005 

Pro Glu Gly Leu Ser Trp Asp lie Ser Glu Gly Leu Leu Gin Gly 
1010 1015 1020 



Val Arg Val Thr Pro Asp lie Phe Ala Pro Asp Arg Asp Glu Pro 
1025 1030 1035 



Gly Thr Asp Asn Ser Arg Val Ala Tyr Asp He Val Ser Leu Ser 
1040 1045 1050 



Pro Thr Asp Arg Asp He Thr Leu Pro Gin Leu Phe Thr Met He 
1055 - ^060 . - - 1065 - 
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Thr lie Giu Lys Asp Arg Gly He Asp Gin Thr Gly Glu Leu Glu 
1070 1075 1080 



Thr Ala Met Asp I.eu Arg Gly Tyr Trp Gly Thr Tyr Glu He His 
1085 1090 1095 



Val Lys Ala Tyr Asp His Gly Val Pro Gin Arg He Ser Tyr Glu 
1100 1105 1110 



Lys Tyr Pro Leu Val He Arg Pro Tyr Asn Phe His Asp Pro Val 
1115 1120 1125 



Phe Val Phe Pro Gin Pro Gly Met Thr He Arg Leu Ala Lys Glu 
1130 1135 1140 



Arg Ala Val Val Asn Gly Val Leu Ala Thr Val Asp Gly Glu Phe 
1145 1150 1155 



Leu Glu Arg He Val Ala Thr Asp Glu Asp Gly Leu His- Ala Gly 
1160 1165 • 1170 



Val Val Thr Phe Ser He Ser Gly Asp Asp Glu Ala Leu Gin Tyr 
1175 1180 1185 



Phe Asp Val Phe Asn Asp Gly Val Asn Leu Gly Ala Leu Thr He 
1190 1195 1200 



Thr Gin Leu Phe Pro Glu Asp Phe Arg Glu Phe Gin Val Thr He 
1205 1210 1215 



Arg Ala Thr Asp Gly Gly Thr Glu Pro Gly Pro Arg Ser Thr Asp 
1220 1225 1230 



Cys Thr He Thr Val Val Phe Val Pro Thr Gin Gly Glu Pro Val 
1235 1240 1245 



Phe Glu Thr Ser Thr Tyr Thr Val Ala Phe He Glu Lys Asp Ala 
1250 1255 1260 



Gly Met Glu Glu Arg Ala Thr Leu Pro Leu Ala Lys T^p Pro Arg 
1265 1270 1275 



Asn He Met Cys Glu Asp Asp Cys His Asp Thr Tyr Tyr Ser lie 
1280 1285 1290 
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Val Gly Gly Asn Ser Met Gly His Phe .Ala Val Asp Pro Gin Sear 
1295 1300 1305 



Asn Glu lieu Phe Leu Leu Thr Pro Leu Glu Arg Ala Glu Gin Glu 
1310 1315 1320 



Thr His Thr Leu lie lie Gly Ala Ser Asp Ser Pro Ser Pro Ala 
1325 1330 1335 



Ala Val - Leu Gin Ala Ser Thr Leu Thr Val Thr Val Asn Val Arg 
1340 1345 1350 



Glu Ala Asn Pro Arg Pro Val Phe Gin Ser Ala Leu Tyr Thr Ala 
1355 1360 1365 



Gly lie Ser Thr Leu Asp Thr lie Asn Arg Gly Leu LeU Thr Leu 
1370 1375 1380 



His Ala Thr His Ser Glu Gly Leu Pro Val Thr Tyr Thr Leu Val 
1385 1390 1395 ' 



Gin Asp Ser Met Glu Ala Asp Ser Thr Leu Gin Ala Val Gin Glu 
1400 . 1405 1410 



Thr Ala Phe Asn Leu Asn Pro Gin Thr Gly Val Leu Thr Leu Asn 
1415 1420 1425 



Phe Gin Pro Thr Ala Ser Met His Oly Met Phe Glu Phe Asp Val 
1430 1435 1440 



Met Ala Thr Asp Thr Val Gly Glu Thr Ala Arg Thr Glu Val Lys 
1445 1450 1455 



Val Tyr Leu lie Ser Asp Arg Asn Arg Val Phe Phe Thr Phe Met 
1460 1465 1470 



Asn Thr Ijeu Glu Glu Val Glu Pro Asn Glu Asp Phe lie Ala Glu 
1475 1480 1485 



Thr Phe Thr Leu Phe Phe Gly Met Arg Cys Asn He Asp Gin Ala 
1490 1495 1500 



Leu Pro Ala Ser Asp Pro Ala Thr Gly Ala Ala Arg Asp Asp Gin 
1505 1510 1515 
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Thr Glu Val Arg Ala His Phe lie Arg Asp Asp lieu Pro Val Pro 
1520 1525 1530 

Ala Glu Glu lie Glu Gin I*eu Arg Gly Asn Pro Thr Leu Val Ala 
1535 1540 1545 

Tbr He Gin Asn Ala Leu Gin Glu Glu Asn Leu Asn Leu Ala Asp 
1550 1555 1560 

Leu Phe Thr Gly Glu Thr Pro . He Leu Gly Gly Glu Ala Gin Ala 
1565 1570 1575 

Arg Ala Val Tyr Ala Leu Ala Ala Val Ala Ala Ala Leu Ala Leu 
3-580 1585 1590 

Leu Cys Val Val Leu Leu He Leu Phe Phe , He Arg . Thr Arg Ala 
1595 1600 1605 

Leu Asn Arg Arg Leu Glu Ala Leu Ser Met Thr Lys Tyr Ser Ser 
1610 1615 1620 

Gin Asp Ser Gly Leu Asn Arg Val Gly Leu Ala Ala Pro Gly Thr 
1625 1630 1635 

Asn Lys His Ala Val Glu Gly Ser Asn Pro He Trp Asn Glu Thr 
1640 1645 1650 



Leu Lys Ala Pro Asp Phe Asp Ala Leu Ser Glu Gin Ser Tyr Asp 
1655 1660 1665 

Ser Gly Leu He Gly He Glu Asp Leu Pro Gin Phe. Arg Asn Asp 
1670 1675 1680 



Tyr Phe Pro Pro Asp Glu Glu Ser Ser Met Arg Gly Val Val Asn 
1685 1690 1695 

Glu His Met Pro Gly Ala Asn Ser Val Ala Asn His Asn Asn Asn 
1700 1705 1710 

Phe Gly Phe Asn Ala Thr Pro Phe Ser Pro Glu Phe Ala Asn Ser 
1715 1720 1725 

Gin Leu Arg Arg 
1730 
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<210> 3<211> 7799<212> DNA<213> Heliothis virescens<220><221> 
prinier_biiid<222> (1982) (2001) <223> primer Fl 

<220><221> primer_bind<222> (4322) (4351) <223> primer R2- - — 
<220><221> primer_bind<222> (2029) (2052) <223> primer R3 



gatcgactag 


GenBank AF367363<309> 2001-08-12<313> (1) 
tgaattgttg trttcaaaaaa gcaagaaatt acgtagttct 


' • • \ # 1 ^TtW 

aaaaat:aaaa 


> 3 
60 


gtgtttttgt 


gtaaagtgta 


gtgctaaaga 


acttggcatt 


gtttgaagag 


gattaaaagg 


120 


agtccaaatt 


gatcacacgc 


aggtggatct 


accaggctiag 


agacacgaga 


tacgacagag 


180 


tgagaactat 


gagatggcag 


tcgacgtgag 


aatactgacg 


gcagcggtat 


tgattctcgc 


240 


tgctaattta 


actfctcgcgc 


aagattgttc 


ctatatggta 


gcaattccca 


gaccagagcg 


300 


accagacttt 


cctaatcaaa 


atttcgaagg 


agtaccatigg 


agtcagaacc 


ctctattacc 


360 


agcggaggat 


agggaagatg 


tgtgcatgaa 


cgcgtttgat 


ccaagtgcct 


tgaaccGcgt 


420 


caccgtcatc 


tt:catgQagg 

^ ^^^^ ^ 




aggggacgt:g 


gccattgcca 


ggcbtaacta 


480 


ccgaggtacc 


aatiactccga 


ccgtggtaac 


tccatttaac 


tttggtacct 


tccacttgtt 


540 


ggggccggtc 


atacgtagga 


tccccgagca 


agggggggac 


tggcatcttg 


ttattacgca 


600 


gaggcaggac 


t:at:gagaccc 


cgaacatgca 


gcagtatatc 


ttcaacgtga 


gggtragaaga 


660 


tgagccccag 


gaagccactg 


tgatgctcat 


cattgtcaac 


atcgacgaca 


acgctcctat 


720 


gat:acagat:g 


ttcgagcctt 


gtgacatticc 


tgaacacggc 


gaaacgggca 


ccacagaatg 


780 


caagtccgtg 


gtgagcgatg 


ctgacggcga 


gatcagcaca 


cgtttcatga 


cgttcgaaat 


840 


cgagagcgat 


cgaaacgacg 


aagaatiatitt 


cgaacttgtc 


agagagaat:a 


ttcagggaca 


900- 


gtggatgtac 


gtccatatga 


ggctcatcct 


caacaaacct: 


ctggactatg 


aggaaaaccc 


960 


gctgcatttg 


tttagagtta 


cagctttgga 


ttccctaccg ' aacgttcata 


cagtigactat 


1020 


gatggtgcaa 


gtcgagaaca 


tagagagcag 


accgccgcgg 


tggatggaga 


tcttcgccgt 


1080 


gcagcagttc 


gatgagaaga 


cagcacaatc 


cttcagggtt 


cgagccatcg 


atggagacac 


1140 


gggaatcgat 


aaacctattt 


tctataggat 


cgaaactgaa 


gaaagcgaga 


aagatctgtt 


1200 


cagtgttgaa 


acaatagggg 


ctggtcggga 


aggtgcttgg 


tttaaagtcg 


ctccaataga 


1260 


cagagacacc 


cttgaaaagg 


aagttttcca 


cgtgtctcta 


attgcgtaca 


aatatggcga 


1320 


taatgacgtg 


gaaggaagtc 


cgtcatttga 


gtcgaaaacc 


gatatcgtca 


t:ca1::tgtgaa 


1380 


cgacgtgaat 


gatcaggcgc 


cggtgcettt 


ccgtccgtca 


tacttcattg 


aaatcatgga 


1440 


ggaaact:gcg 


atgacactta 


atttggaaga 


ctttggtttc 


cacgatagag 


atcttggtcc* 


1500 


gcacgcgcag 


tacacagtgc 


acctggagag 


catccacccg 


gcgggagcgc 


acgaggcgtt 


1560 


ctacatcgcg 


ccggaggtgg 


gctaccagcg 


acagtccttc 


atcgtcggca 


cgcagaacca 


1620 
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tcacatgctc 


gacttcgaag 


tgccagagtt 


ccagaagat:a 


caacttaggg 


tggtagccat 


1680 


agacatggac 


gatccgaggt 


gggttggtat 


cgcgattata 


aacattaacc 


tgatcaacrtg" 


1740 


gaacgatgag 


ctgccgatct 


tcgagcacga 


tgtgcagact 


gtcaccttca 


aggagacgga 


1800 


gggcgctggc 


ttccgtgtcg 


ccactgtact 


ggctaaggac 


cgggatattg 


atgatiagagt 


1860 


cgaacactct 


ctaatgggca 


acgcagtcaa 


ctacctgagt 


atcgacaaag 


acaccggcga 


1920 


catcctcgtg 


acaatcgacg 


atgcat:tcaa 


ctatcacaga 


caaaacgagc 


tctttgtgca 


1980 


gatacgagct 


gacgacacgc 


tgggagagcc 


gtacaacacc 


aacactgcct 


gttccggact 


2040 


gtcacatcgc 


gccggcctat 


gaggtcgcgc 


cagcacacgt 


catcgtgcgc 


cccacctaag 


2100 


ctgggccctc 


accatacgcc 


ggacccccgg 


acactcgcfcc 


agcgaccccg 


gtcgcgcat^a 


2160 


cacgaccgca 


cgggcaacgc 


gcgattttct 


ct:tgt:acata 


cttcaatzaca 


gtcttctttg 


2220 


caaatcgaag 


tttcatfcgaa 


ccgccgagac 


catcatccta catctggac^c 


tcggcgctca 


2280 


agcattggtc 


cctcgcaaga 


actgaaccga 


acaggagaag 


tcgtcttgca 


agctgcgcaa 


2340 


ccgcccgtgg 


actttggact 


ggtcacccgc 


cfcctgcgact 


tacgcgtcgt 


categcatcg 


2400 


tggaggacta 


gtggggacgt 


gtctacccgg 


aaggaaggaa 


catactgag^ 


gagtccgttc 


2460 


cgatatccct 


ttgaattcct 


gtcgttattc 


ccttgtatcc 


aggaatccag 


gaattiliaaag 


2520 


ggcttcttac 


attgatcagg 


catattgatc 


tcttcataca 


tcttatttct 


fcacgcttacg 


2580 


tactaaattc 


ataaatctat 


aaacttttgc 


acataagtt:c gttatctttt 


aaacagtaga 


2640 


ttctttgttc 


atttgcatta 


cgtaaacaag 


ctcattagcg 


agcggcttag 


cgaaccggtt 


2700 


tctaaacccg 


gtcgttcgct 


tcgaatcagc 


tgattcattg tcacttgccfc 


at:t:t:aaacat: 


2760 


tttatttcat 


atttaacgtt 


tttttctttg 


tttgtttatt 


ttattttatt 


ttcattcttc 


2820 


cacgtttgct 


ttcttgctta 


aagtatggct 


gatgcaacgt 


tggaggaact 


tcttagaggg 


2880 


cgtggtaatt 


tcgaagccga 


gcttggacgt 


ttcgataact 


ttttaaattic 


gt:taacaagt 


2940 


ccatatttag 


acagtgctca 


tatggctaca 


ttgcagtctc 


gaattagttt: 


tatat:t:ggca 


3000 


ctttatgatg 


acttcgacat 


gtttcagacg 


gagttggagg 


tgttgtcgga 


gaacgtcgac 


3060 


gacatgttgg 


tcgagcggga 


gcgtatcgag 


tcgcggtact 


tcgcactggt 


ggcgcgtgcg 


3120 


aagttcaccc 


t:taaacacgg 


gcgtgcgtca 


tcacctgcgg 


tcttgcctgc 


acttcctgtt 


3180 


gaagacgtcc 


ggcagcacgg 


ctctgactgc 


gggcacggcc 


tggttcagct 


gccagagatg 


3240 


gacttgagct 


gcataccttc 


tgagaccagt 


accgggalitic 


ttgcatctcg 


gggggtgaag 


3300 


gctgattcca 


tcagctcgtc 


cactctttgg 


cggtcaggtt 


gbacacagta 


cacaaccacg 


3360 


tcatcaaatt 


cagtacatac 


aacatctatt 


acgtatatta 


at:aat:gaat:t: 


ggcacacaac 


3420 
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3a.ca.a,caa.t: t 


ccat:ttcat:t. 


aataaaccaa 


aaaacatctig aaccacat^gc 


aaaaticatica 


3480 


ca.t:aacctca. 


tatitacagac 


cbtctca^ca 


cacataatat atcaclicaaa 


tacatactztt 


3540 


acgatgccat 


gcatzcaaaga 


tiatiaccattig 


cctttcratac attaactaca 


t" "h" a fa ccraac 


3600 


aaactacgtt 


acacattgtc 


aaQccftttta 


tactctacaa cccfaacrcccrt 


oic 1 1 r* t" cxcfacf 


3660 


ctciQtacccQ 


tgtttcttgc 


gcctitcagcc 


a.gttatccgg actigacgcgc 


crtacfcccaae 


3720 


aacacictoct 


tcttttttct 


ttgcattttc 


cttttttcac ccttttcatg 


a tcracra't cac 


3780 


1 1 ticrc t atcici 


tcacrcrti'kacia 


ocr c t to c 1 1 a 


aafiticca'tac: cttctccetc 


t ac r*f a t" t" o 


3840 


acclzctzctca 


tcttttccca 


gacacatcca 


tiatcocfaccici ccactcatcrt 




3900 


ccggcagatc 


acfcaacccac 


ctctfcctgfcg 


actciticacca crtaactacta 


cctccat'^^aa 


3960 


cgcac^accaQ 


agagg eg t c a 


ccacaccatic 


atcatcafcca ■h'cttcraacrcc! 


t "fr* aa f*a cf cif'h 
w t»v^ctwdy ^ w L> 


4020 


'tcaacccggcf 


gaatzatigtitic 


ccioacticf'tca 


cs.1r.ccicac*c*ci crf^ct^ataacrcx 


^ c*ac*n r* c* Side* 




acaccrtcatc 


gtzgcgcccca 


c c tiaacr ct aa 


accetcaoca ■l^aGQfr*Gfcrar» 


w w w wy y OL w ci V* 


4 1 AO 


tcgctcatcg 


accccggtcg 


ccrcatacacor 




i~ i" a rf^ fr" r" i" i" cr 


A900 
^ ^ \i u 


ticacctia'bcti 


ataatiacagti 


cttctacttt 


aaar»at'<^craa at"t't~4~ai"'t*cra 


ct d v« y ^ y ci y ct 












UdCa.gC.l^y v^d 


A '^0 fl 
U 




aa c c ac c t r: 


cfcacof't" oaci 




iM>ciy L.y y ddy d 








"t oa t f*or»oa f 




Im> ^y dCrdO v»ctO 






car*ttGacfca 




wc^v* t.» i.»v*v.> lectin* y v^t.»<tw.v*ctcty %■> 


aggga 




tactaafecrct 


a acf oaa 1 1 fccr 


tfcaattcitat 


acraaa t* orrarr a#^dfTrHa"f*ar*r« 
«>jc».aicii- wy dy civ*^y cct t^^ciwv- 


wy ctdw u>y del 


A t>^0 


cgac cgaggc 


accgcca't^cg 


ciccaccitcicrt 


yy t» ^-wy wy ciy ci.L.v^v^yyy h-clv^ 


a f^t^ H <^ a ^ ^ a 
ct*-*y i„v^ct^ L^ctu 




aciactiacaacf 




fccictotacct 


€^aoattt*<**artfi Vf ♦* t^nns* ♦* r« 
woLway Ayy y i,^wciwyyciii>\.« 


^ a a ^ a e^f^rx \' 
LvWdcLwavry y w 


A^AO 
^ 0 0 u 


. cattcTGracfac 


gactza^gatza 


ti atcaaca ti ti 


cacaatcata a^aaiiacfaca 


"1" CTa a orra r*a a 
t«y ddWry d v«dd 


A7AO 


ccc^ccgct^ 




cf cactctaac 


y «-*«y y ^y *— ^ *^y ***y **y **y 


dyy L.w*y^w>y ^ 


A O no 




atit^atiaggatz 


ccgt^actcgc 


taccciafcatc craccicracctc? 


ffefcaf'aa t*<^a 




acrlicrccrcrlia'b. 


acca'tcactic 


ctaaactacra 


cactccacfaa cfar^r*'t"aat*crcy 


dy d !.« wy d V* w 


4920 


caattcg9[gtz 


cagatct^cag 


tioaaaaaaca 


ccaoacfcafcr* cracar'CfCfar'cr 


dy u« <«»y w «^ y wy 


4980 


cca^cacc^c 


tacbacaccg 


*-yy wVi»y k^^qvj 


C!Ciac!!aacit~crr* aar^f*^cir»i"r«t* 


o rrf" a # 'fit' 
w t»y i»v«y dv«y w 


CAAO 


gtgtccgcct 


gatcctaact 


acttcaacac 


gcccggagag ataacgaticc 


acat:aacaga 


5100 


cacgaacaat: 


aaggtgcctc 


gagtggaaga 


ggacaagttc gaggaaactg 


tctatzatcta 


5160 


cgagggcgcg 


gaggacggag 


aacacgtzcgt: 


gcaactcttc gccagcgatc 


tggatragaga 


5220 


tgaaatctac 


cacaaagtga 


gctaccagat 


caactacgcg atcaaccctc 


gtctccgcga- 


5280 - 
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cttcttcgag gtagacttgg agaccggcct ggtgtacgtc aacaacactg ccggggagaa 534 0 
actcgacaga gacggcgatg aacccacgca tcggatcttc ttcaacgtca tcgataacttr"- 5400 

ctatggagaa ggagatggca accggaatca ggacgagaca caagtgttgg tggtgctgtt 5460 

ggacatcaac gacaactacc cggaactgcc tgagggtctc tcatgggata tctctgaggg 5520 

cttgctacag ggtgtccgtg taaccccaga tatcttcgcc ccggaccgcg acgagcccgg 5580 

caccgacaac tcccgcgtgg cgtacgacat cgtcagcctc acgcccaccg acagggacat 5640 

cacacttcet caactcttca ccatgatcac catagagaag gacaggggca tcgaccagac 5700 

tggagagctg gagaccgcta tggatttaag aggctattgg ggcacttatg aagtacatgt 5760 

caaggcatac gaccatggag ttcctcaaag gatatcctac gagaagtacc cgctagttat 5820 

aagaccttac aacctccacg accctgtgtt cgtgttccct caacctggaa tgactatcag 5880 

actcgcggag gagcgagcag tagtgaacgg cgtgctggcg acagtggacg gcgagttcct 5940 

cgagcgaatc gtcgccaccg acgaggatgg cttacacgct ggagttgtta ccttctctat 6000 

ctctggagat gaggaggcgt tgcagtactt cgacgtgttt aacgacggag tgaacttagg 6060 

tgcgctgacc atcacgcagc tcttccctga agacttccga gagtttcagg tgacgattcg 6120 

tgctacggat ggtggtacgg agccaggtcc aagaagtacg gactgcacgg tcaccgtagt 6180 

gtttgttcct acgcagggag agcctgtgtt cgagactagc acctacacgg tcgcttttat 6240 

tgagaaagac gctggtatgg aagagcgggc cacgctgcct ctcgccaagg acccgcgtaa 6300 

cataatgtgt gaagatgatt gtcacgacac ttattacagc attgttggag gcaactcgat 6360 

gggccacttc gcagtagacc cccagtccaa cgagctgttc ctgctgacgc cgctggagcg 6420 

cgcggagcag gagacgcaca ccctcatcat cggcgccagc gactcgccca gcccggccgc 6480 

cgtgctgcag gcttccaccc tcactgttac tgtcaatgtt cgagaagcaa acccgcggcc 6540 

agtgttccag agcgctctgt acacagccgg catctccacc ctcgacacca tcaacagagc 6600 

tctgctgacc ttacacgcga cccattcaga aggcctgccc gtgacctaca cgctgataca 6660 

agactctatg gaagctgact ccacactgca agctgtgcag gagacagcct tcaacctgaa 6720 

ccctcagact ggagtgctga ccctcaactt ccagcccaca gcatctatgc acggcatgtt 6780 

tgagttcgat gtgatggcta ctgatacagt gggagagacc gcgcgcactg aagtgaaggt 6840 

gtacctgata tccgaccgca acagagtgtt cttcacgttc atgaacacgc tcgaagaagt 6900 

cgaaccgaat gaggatttca tagcggaaac atttaccctg tttttcggca tgcggtgcaa 6960 

catcgaccag acgctgcccg ccagcgaccc cgccaccggc gccgccaggg acgaccagac 7020 

cgaagtcagg gcacacttca tacgcgacga cctgcctgta ccggctgagg agattgaaca 7080 
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gttgcgcggt aatccaacac tagtggcgac aatccagaac gccctgcagg aggagaacct 7140 

gaacctagcc gacctgttca cgggcgagac tcccatcctg ggcggcgagg cgcaggcgcg 7200 

ggcggtgtac gcgctggcgg cggtggcggc cgcgctcgcg ctgctctgtg tcgtactgct 7260 

tatactcttc ttcatcagga ctagggccct caaccgtcgt ctggaagctc tctccatgac 7320 

caagtacagt tcccaagact cgggtctgaa: ccgcgtgggt ctggcggcgc cgggcaccaa 7380 

caagcacgcg gtggagggct ccaacccaat ctggaacgaa accctcaagg caccggactt 7440 

tgatgctctt agcgagcagt cgtacgactc ggatctgatc ggcatcgaag acttgccgca 7500 

gttcaggaac gactacttcc cgcctgacga ggagagctcc atgcggggag tcgtcaatga 7560 

acacatgcct ggagctaatt cagtagcaaa ccataacaat aacttcgggt tcaacgctac 7620 

ccccttcagc ccagagttcg cgaactcgca gctcagaaga taaaatatta tagtattttt 7680 

ttatataata ttatgtaaaa gtgatataac gcacactaaa atttacctat aagtatatat 7740 

tgaagtgtaa gatactcgta ttatgtaaga gcatttattt ttttactacc agacagtaa 7799 

<210> 4<211> 2263<212> DNA<213> Heliothis virescens<400> 4 
tgttccggac tgtcacatcg cgccggccta tgaggtcgcg ccagcacacg' tcatcgtgcg 60 

ccccacctaa gctgggccct caccatacgc cggacccccg gacactcgct cagcgacccc 120 

ggtcgcgcat; acacgaccgc acgggcaacg cgcgatfcttc tcttgtacat acttcaatac 180 

agtcttcttt gcaaat:cgaa gtttcattga accgccgaga ccatcatcct acatctggac 240 

ctcggcgctc aagcattggt ccctcgcaag aactgaaccg aacaggagaa gtcgtcttgc 300 

aagctgcgca accgcccgtg gactttggac tggtcacccg cctctgcgac ttacgcgtcg 360 

tcatcgcatc gtggaggact agtggggacg tgtctacccg gaaggaagga acatactgag 420 

tgagtccgtt ccgatatccc tttgaattcc tgtcgttatt e'ccttgtatc caggaatcca 480 

ggaatttaaa gggcttctta cattgatcag gcatattgat ctcttcatac atcttatttc 540 

ttacgcttac gtactaaatt cataaatcta taaacttttg cacataagtt cgttatcttt 600 

taaacagtag attctttgtt catttgcatt acgtaaacaa gctcattagc gagcggctta 660 

gcgaaccggt ttctaaaccc ggtcgttcgc ttcgaatcag ctgattcatt gtcacttgcc 720 

tatttaaaca ttttatttca tatttaacgt ttttttcttt gtttgtttat tttattttat 780 

tttcattctt ccacgtttgc tttcttgctt aaagtatggc tgatgcaacg ttggaggaac 840 

ttcttagagg gcgtggtaat ttcgaaigrccg agcttggacg tttcgataac tttttaaatt 900 

cgt:t:aacaag tccafcattta gacagtgctc atatggctac attgcagtct cgaattagtt 960 

ttatattggc actttatgat gacttcgaca tgtttcagac ggagttggag gtgttgtcgg 1020 

agaacgtcga cgacatgttg gtcgagcggg agcgtatcga gtcgcggtac ttcgcactgg " 1080 " 
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tggcgcgtgc gaagttcacc cttaaacacg ggcgtgcgtc atcacctgcg gtcttgcctg 1140 
cacttcctgt tgaagacgtc cggcagcacg gctctgactg cgggcacggc ctggtt<:age-'' 1200 
tgccagagat ggacttgagc tgcatacctt ctgagaccag taccgggatt cttgcatctc 1260 

ggggggtgaa ggctgattcc atcagctcgt ccactctttg gcggtcaggt tgtacacagt 1320 

acacaaccac gtcatcaaat tcagtacata. caacatctat tacgtatatt aataatgaat 1380 

tggcacacaia caacaacaat tccatttcat taataaacca aaaaacatct gaaccacatg 1440 

cagaatca^c acataacctc atattacaga ccttctcatc acacataata tatcactcaa 1500 

atacatactt tacgatgcca tgcatcaaag atataccatt gcctttgata cattggctgc 1560 

attacacgag caaactacgt tacacattgt caagcgtttt gtgctctgca gccggagccg 1620 

tgcttctgga gctggtaccc gtgtttcttg cgccttcagc cagttatccg gactgacgcg 1680 

cgtactccga caacagctgc ttcttttttc tttgcatttt cctttttt'ea cccttttcat 1740 

gatgagatca ctttgctatg gtcaggttag aggcttgctt aaattccata ccttctccct 1800 

ctacccatct gacctctctc atcttttccc agacacatcc atatcggacg gccactcatg 1860 

tgtacctttc accggcagat cactaaccca cctcttctgt ggctgtcacc agtgactact 1920 

gcctccatca acgcagacca gagaggcgtc accacaccat catcatcatc atcttgaagc 1980 

cttgacaggc ttcaacccgg ggaatatgtt ccggactgtc acatcgcgcc ggcctatgag 2040 

gtcgcgccag cacacgtcat cgtgcgcccc acctaagctg ggccctcacc atacgccgga 2100 

cccccggaca ctcgctcatc gaccccggtc gcgcatacac gaccgeacgc gcaacgcgcg 2160 

atctactctt gtcacctatc tataatacag tc.ttctactt tgaacatcga agtfcttattg 2220 

aaacgccgag accagcaacc tacacctgca cctcggcgct caa 2263 

<210> 5<211> 252<212> DNA<213> Heliothis virescens<400> 5 
tgttccggac tgtcacatcg cgccggccta egaggtcgcg ccagcacacg tcatcgtgcg 60 

ccccacctaa gctgggccct caeca tacgc cggacccccg gacactcgct cagcgacccc 120 

ggtcgcgcat acacgaccgc acgggcaacg cgcgattttc tcttgtacat acttcaatac 180 

agtcttcttt gcaaatcgaa gtttcattga accgccgaga coat cat cct acatctggac 240 

ctcggcgctc aa 252 

<210> 6<211> 257<212> DNA<213> Heliotliis viresceiis<400> 6 
tgttccggac tgtcacatcg cgccggccta tgaggtcgcg ccagcacacg tcatcgtgcg 60 

ccccacctaa gctgggccct caccatacgc cggacccccg gacactcgct catcgacccc 120 

ggtcgcgcat acacgaccgc acgcgcaacg cgcgatctac tcttgtcacc tatctataat 180 
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acagtcttct actttgaaca tcgaagtttt attgaaacgc cgagaccagc aacctacacc 240 
tgcacctcgg cgctcaa 257 

<210> 7<2X1> 26<212> DNA<213> Heliothis virescehs<400> 7 
atacgagctg acgacacgct gggaga 26 

<210> 8<211> 30<212> DNa:<213> Heliothis virescens<400> 8 
tctgagcgta ggaggtgtgt tgttgatgtc 30 



<210> 9<21X> 24<212> DNA<213> 
gcgcgatgtg acagtccgga acag 



Heliothis virescens<:400> 9 
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